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COMPETING EXPONENTIAL RISKS, WITH PARTICULAR 
REFERENCE TO THE STUDY OF SMOKING AND 
LUNG CANCER* 


JOSEPH BERKSON 
Mayo Clinic 
AND 


Lita ELvEBACKT 
Tulane University 


In various contexts of statistical problems one deals with the meas- 
urement of a “risk”—a risk of dying, a risk of becoming ill, and so forth. 
In applied vital statistics various indices are used to measure such 
risks, and typically these are not probabilities. In formal mathematical 
statistical analyses, typically they are measured as probabilities. A 
special problem arises when more than one risk must be measured at 
the same time. Even if they are independent in a technical statistical 
sense, the presence of one risk will complicate the measurement of 
another. This situation has been referred to as “competing risks.” In this 
paper we apply an analysis of competing risks to some questions that 
arose in the study of the relation of smoking to lung cancer. 


INTRODUCTION 


N SOME “prospective” studies, in which the death rates from various causes 
I among smokers were compared with the similar death rates among non- 
smokers, the “mortality ratio,” which, in the present application, is the ratio 
of the death rate among smokers to the corresponding death rate among non- 
smokers, was used to measure the “association” between smoking and death 
rates. The use of this index, in the way it had been employed in these investiga- 
tions, was criticized by one of the present authors [1] as misleading. Instead, the 
simple difference of the death rates as between smokers and nonsmokers was 
used for the analysis of the primary questions presenting themselves. 

Dr. Mindel Sheps, in a commentary article [17], extended the case against 
the mortality ratio. At the same time she criticized the use of the simple differ- 
ence of the rates as illogical.? On the basis of a heuristic development involving 
the idea of persons “exposed to risk,” she presented a different computation to 





* This investigation was supported in part by a grant from the National Science Foundation. 

t Now at The Public Health Research Institute of the City of New York, Inc. 

1 The difference of rates was used for analyses also by some statisticians reporting prospective studies, in par- 
ticular, Hammond and Horn [12]. 

2 She says, “... the difference... has little meaning unless it is related to the number of potential survivors.” 
This stricture is rather too severe. Sheps herself says (page 1211) “The absolute difference . . . describes concretely 
the number of persons affected... ” 
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measure the putative effect of smoking, and later [16] showed this to be a 
maximum likelihood estimate. Sheps dealt only with total deaths from all 
causes, and not with deaths from specific causes. It was to be noted that the 
formula she used had appeared in various statistical contexts, thongh it had 
not previously been developed as a maximum likelihood estimate. 

It appeared useful to develop similar formulas applicable to deaths from 
specific causes. Here again it was to be observed that, at least for one element 
of the problem, some relevant formulas had been developed, in this case by 
Neyman and associates, under the concept of “competing risks.” It occurred to 
us that it would be worthwhile systematically to elaborate the various formulas 
in a unified way, basing the development explicitly on the concept of competing 
risks. 

The first, part of the presentation develops the necessary formulas, in a 
standard way. The motive is expository. A considerable number of the formulas 
have already been published, and are to be found in the references listed at the 
end of the paper. 

Following this is presented a model for the effect of smoking in causing death 
from a specific disease, in which the parametric measure of the effect is in terms 
of the net probability of death from this disease. An example is given, and some 
comments are made, contrasting this measure with the mortality ratio. 


COMPETING RISKS 


We consider the model of exposure to two competing risks in the sense of 
Neyman [15] and take first a very simple example. 

Two marksmen shoot at a range of targets under conditions in which, if a 
target is struck, it instantly drops from view so that it cannot be struck again. 
Represent the striking rate of marksman 1, that is the probability of a hit when 
he is firing alone as q:, and similarly the rate of marksman 2 when he is firing 
alone as go. The probability when one risk operates alone is called the net risk 
or rate and is represented by lower case g; when it operates together with 
another risk it is called the crude risk or rate and is represented by capital Q. 

Suppose N targets are exposed and marksman 1 shoots first, followed by 
marksman 2: 


Rate for 1 is Q; = q1 (1) 
Rate for 2 is Q2 = (1 — qu)qz (2) 
Total rate is Q = Q: + Q2 = qi t+ G2 — G92 (3) 


Suppose marksman 2 shoots first, followed by marksman 1, then: 
Rate for 2 is Q2 = qz (4) 
Rate for 1 is Q; = (1 — qo)qi (5) 
Total rate is Q = qi + g2 — 9192 (6) 


It is seen that the total crude rate with both marksmen shooting is the same, 
whichever marksman shoots first, and assuming independence of the net proba- 
bilities gq: and gq this will be true in general. Regardless of the order of shooting 
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or whether the two marksmen shoot together, the total crude rate is given by 
(3), (6), which, of course, can be derived as the complement of the product of 
the probabilities, p,=1—q, and p2=1—Qq, of not being struck (survival rate). 

If, from independent trial, we know qi, the net rate of marksman 1, and have 
a record of Q, the crude rate when both shot together, we can derive the net 
rate gz from (3): 


_ Q@-u 
l—q 


q2 (7) 
But, suppose we did not independently know the net rate q: or q@, and knew 
from observation only the crude rates Q:, Q2 and Q=Qi+Q2, then we could not 
determine the net rates gq: and q: from these crude rates alone. We have seen 
that, with the same net rates q and q, although the total crude rate Q is inde- 
pendent of the order of shooting, the crude rates Q,, Qe for each marksman will 
depend on which marksman shot first. This problem of estimating a risk, when 
another independent risk is operating at the same time, is one that arises in 
different contexts in many statistical problems. 


THE INSTANTANEOUS RISK, THE NET RATE, AND THE CRUDE DATE 


In order to estimate the net q’s from the crude Q’s, something has to be known 
regarding the time relation of the risks. A simplifying assumption, which is fre- 
quently reasonable, is to suppose that each instantaneous risk® (called the “force 
of mortality” in actuarial texts) is constant over the period of observation, and 
that the risks are independent. We will use 8 to represent the instantaneous 
risk, and shift to the example of dealing with two mortality rates, namely q, 
the net mortality from some specified disease, and q2, the net mortality from all 
other diseases than 1, taken together and considered as a single risk. Then the 
net probability of death before ¢ from the respective causes is given by‘ 


Qu = 1 — Fi (8) 
Gor = 1 — e7 Fst (9) 
where (; is the instantaneous risk for net risk 1, 8: is the instantaneous risk for 


net risk 2, and ¢ is the time measured from t=0. From (8), (9) we have the cor- 
responding net probability of survival to time ¢ 


Pir = 1 — Qu = e*! (10) 
Po = 1 — qa = e F* (11) 


The probability of survival to time t, with both risks operating together, is the 
product of (10), (11) 





3 If lis the number surviving at time ¢, 

dl, d(In i) 
dt dt 
is the instantaneous risk at ¢. 

4 In some situations the known facts contradict the assumption of exponentiality of the net risks, as for instance 
the consideration of “withdrawals” as a competing risk in computing the T-year survival rate [4]. For the present 
purpose of computing relative competiug risks among different independent risks, at least when the risks are small, 
the assumption of a different function, instead of the exponential, makes little difference in the results. 
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P, — e— Bi+8)t = eft (12) 
and the probability of dying before ¢ is 
Q,=1-P,=1-—e%* (13) 


where 8 = 81+ fz. 

The formulas (10), (11), (12) represent “survival functions” in the context of 
actuarial discussions. 

It is seen, comparing (12) and (10), (11), that the crude rate with several 
net risks operating together has the same exponential form that defines the net 
rates, with the instantaneous crude risk equal to the sum of the instantaneous , 
net risks. As has been remarked previously, some functional relationship of the 
risk with time is needed for estimating the q’s if we are given only a knowledge 
of the Q’s. The exponential function is unique in that the function remains the 
same when a new risk is added (or removed). 

Without loss of generality, we can consider the period of observation as from 
t=0 to t=1 (OStS1). If observations are made over a period from t=0 to 
t=T, we can utilize the formulas given here by replacing the scale ¢ with a 
scale x, where x=t/T. 

The proportion of persons dying from cause 1 over the unit period from t=0 
to t=1, which is the crude death rate from cause 1, is 


. Bi Bi 
Q -f e*'8. dt = — |1 — e*| = —Q 14 
1 ; 16 B [ J a. (14) 
and similarly for cause 2 
o«hhw~enehe (15) 
B B 


and for all causes, the probability of death before the end of the period is 
Q=A+Q=1-e¢% (16) 
and the probability of survival to the end of the period is 
P=1-Q=e? 
The net death rates over the unit period are 
ga =l-p=l-—e* (18) 
q@ =1—p=1—e* (19) 


If, observing the crude rates Q:, Q2 and Q=Q,+Q:, we wish the net rates 
1, q2, these can be obtained from® 


In (1 — @) = = = n= Q) 20) 





* Suppose that the observed death rate per annum from all causes, computed as a probability, is Q, and the 
observed rate from some specific cause, say lung cancer, is Qi. We wish to know the total death rate if lung cancer 
as a cause of deatl, were eliminated. We may consider the death rate from all causes except lung cancer as a single 
death rate Q2 =Q—Q:. The desired rate is then obtained as gq: from (21). This is equivalent to the formula given 
by Greville [8] to compute the life table resulting from the elimination of specific causes of death. 
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In (1 — q) = - = Fin — Q) 


which are derived directly from (14), (15), (17), (18), (19). 


MAXIMUM LIKELIHOOD ESTIMATE, TIME ESTIMATES 


We will now consider that we are dealing with observations subject to random 
variation, and develop the maximum likelihood estimates and variances of the 
instantaneous risks and the net rates. 

Suppose we observe not only the number of deaths, but also the time when 
each death occurred. N individuals are observed over the unit period from 
t=0 to t=1. We have observed d, deaths at times tn, te, ---, ha, and de 
deaths at times to, too, - « * , fea,, and s=N—d,—d2 survivors to the end of the 
unit period. A death from cause 1 at time 4; contributes to the joint sample 
probability a factor Bie*4i, where 8=6,+2. Similarly a death from cause 2 
at time é; contributes a factor 6,e~°i. Each survivor contributes a factor e~8. 
The joint probability is proportional to 


o= By e bru Bo% ebzt2 e—Bs 


22 
= By4t e-Pi(Ztt#) gods g—Ba(Ztte), (22) 


From (22) it is to be observed that [d, (Zt+s) | is sufficient for 6: (with Be 
fixed), and [ds, (Zt+s) | is sufficient for 62 (with 6; fixed). It is of interest to note 
that for 6, with d, and s, it is Zt, not Yt, which is sufficient; similarly for 82, with 
dz and s, it is Zt, not Yt which is sufficient. In noting the time of death, it is 
irrelevant for the estimation of 81, 62, whether the death is from cause 1 or 
cause 2. 

From the joint probability of the set of observations (22), the maximum like- 
lihood estimates and their asymptotic variances are obtained in the standard 
way as derived from Fisher [6], [7]. The first derivatives 2 In ¢/08, and 
d In ¢/@f2 are set equal to zero, and the solutions for 6; and fy are the respective 
maximum likelihood estimates. The asymptotic variances are obtained from the 
inverse of the matrix of terms 

{ _ 0? In ‘ 
08:08; 


where St=24,4+2h2 


2 
o (asymptotic) = ——— (24) 


B2? + BiB2 


%, (asymptotic) = Na —e*) (25) 


For the estimate of any of the probabilities g, we can remember that, 
q=1—e-*, g=1—e~, var. g=(1—g)? var. B=p? var. £. 
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The maximum likelihood estimates (23) for 1, 82, are functions of the suffi- 
cient statistics for the respective parameters, following a general theorem for 
maximum likelihood estimates. However, the maximum likelihood estimates, 
in this case, themselves are not sufficient, since it is possible to have two sam- 
ples with the same maximum likelihood estimates, though the values of their 
sufficient statistics are not identical. 

If there is no particular interest in the mortality from specific diseases, but 
only in the total force of mortality 8=6:+ 2, then only the number of deaths, 
regardless of cause of death, and their times are required. The maximum likeli- 
hood estimate for 8 = 1+ 2, and its asymptotic variance are [5], [9], [14]: 

3 : (26) 
Blk scteracent. . 
zi+s 
B? 


NG _ e~8) ad 


a? (asymptotic) = 
where d=d,+de. 

The statistic [d, (Z¢+s) | is sufficient for 8, but, as with estimation of fi, Be, 
the maximum likelihood estimate is not sufficient. If, instead of truncating 
the experience at some time 7’, it is truncated when some fixed number r<N 
of deaths is observed, then the maximum likelihood estimate is sufficient [5]. 
It is to be noted that 


B - Ay + Bs, 
and this exemplifies the generality that the maximum likelihood estimate of a 
function is the function of the maximum likelihood estimate. Also 
d, A d, 


B —_ 6, Bs —_— > eee 28 
d d (28) 
So with 8 estimated from (26), 81, 82, can be estimated as in proportion to the 
respective number of deaths d, and d2, without computation from (23). 


FREQUENCY ESTIMATES 


We may now turn to the situation in which we have observed the number of 
deaths d; and d, from each respective cause, but not the times of death, and 
develop the maximum likelihood estimates based on the observed frequencies. 
The sample size = NV, d=d,+d, is the total number of deaths, and s= N —d,—d, 
is the number of survivors. We will recall that the probability of a death from 
cause 1 in the unit period is 


Q; = ato P, = ” (1 —_ e~8) (29) 


and for a death from cause 2 it is 


Be 
Q.=1-—P;=— (1—-—e-*) 
2 a é 
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and the probability of survival to the end of the unit period is 
P=e¢? 

The probability of the total sample is proportional to 


d, 2 d2 
+-[Ba-er} [Soo] 


From (32) the maximum likelihood estimates and their variances are 


(33) 


(34) 
2 
Bi(1 — e~*) mm 
2 
(1 — e-8 
+ a. =) (36) 
Be? 


Since, in applying (20), (21) to obtain 6, 62, one takes for Q;, Q2 and Q, the 
corresponding relative frequencies d,/N, d2/N, and d/N=1—s/N, the maxi- 
mum likelihood estimates (33), (34) are seen, in effect, to be the same as (20), 
(21) which were developed directly. 

For the estimate of 8, we write the joint probability as proportional to 


@ = (1 — e*)4 e-*, (37) 
From (37) we derive the maximum likelihood estimate and its variance 
8 = In (N/s) (38) 
o: (asymptotic) = tes (39) 
r) Ne-# 
As with the time estimates 
B= Bit 8: 
The probability of death in the period is 
Q=1-P=1-e* 


and therefore the variance (39) can be written 


a; (asymptotic) = 


If we think of estimating Q=1—e~ instead of 8, then 
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0 =d/N (42) 
» _ PQ 
= W (43) 


In the case of (43) the variance is exact instead of only asymptotic, (42) being 
the usual estimate of a binomial parameter. 


COMPARISON OF VARIANCES 


We have presented two sets of estimates, one based on the number of ob- 
served deaths and the times of these deaths—the “time estimate,” and the 
other based only on the number of deaths—the “frequency estimate.” Pre- 
sumably, the time estimate contains more “information” in regard to 8, and 
this should be reflected in a smaller variance of the estimate. We have only 
formulas for the asymptotic variances, but we believe that, in the present situa- 
tion, they probably are very good approximations of the exact variance for 
moderately large samples. Since 8= 81+ 2, it seems reasonable to assume that 
a comparison of the estimates for 8 should give a fair idea of how they compare 
for estimates of 81, Be. 

It is clear on inspection that the frequency estimate (38) 


8 = In (N/s) 


cannot be good for large 8, when the Q is close to 1. In this case the probability 
that s=0, for even fairly large N, will not be small, and for all samples with 
s=0 the estimate of 8 from (38) is not determinable. , 

In Table 1 are shown the relative variances of the two estimates for different 
values of Q. It is seen that for small Q<.05 the variance of the time estimate 
is virtually equal to that of the frequency estimate. For Q<0.6 the relative 
efficiency of the frequency estimate is greater than 0.9. Only with Q>0.9 does 
the efficiency fall below 0.5. Since the frequency estimate requires only the 
number of deaths and not their times and is easier to compute than the time 
estimate, it may be found satisfactory for use, except with very large Q. 


TABLE 1. COMPARISON OF VARIANCES. TIME 
ESTIMATE AND FREQUENCY ESTIMATE 








| N times the asymptotic variance 














Q pone “ Relative efficiency 
| Time estimate Frequency estimate 
.001 -0010 .0010 1.000 
.O1 .0101 .0101 1.000 
.05 .0526 .0526 1.000 
.10 -1110 -1111 .999 
.50 | .9609 1.0000 .961 
.60 1.3993 1.5000 .933 
.90 | 5.8910 9.0000 -655 
95 8.9744 19.0000 472 
.99 | 21.4218 99 .0000 -216 
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MEASURE OF THE MORTAL EFFECT OF SMOKING 


Sheps [16] has considered how the effect of smoking may be assessed from 
the analysis of “prospective” studies of the relation of smoking to death rates. 
On the basis of a heuristic development involving the idea of “exposed to risk,” 
she derived the maximum likelihood frequency estimate of the net risk, due to 
smoking, of death from all causes. For this particular estimate, no assumption is 
required regarding the time relation of the risks. We will consider the problem 
more in detail, particularly in regard to death from specific causes, on the basis 
of the development presented in the previous sections. 

As before, consider deaths as segregated in two classes, those due to some 
specific disease for which we take cancer of the lung as an example, and all 
other causes taken together and considered as a single cause. Nonsmokers are 
subject to deaths from “natural causes.” Smokers also are subject to death from 
natural causes but we assume that, in addition, they are subject to deaths from 
lung cancer caused by a specific carcinogen Y, and from other diseases caused 
by substances X, the substances Y and X being contained in tobacco smoke. 
We assume that these causes act independently, and that the net probabilities 
of death before ¢ (QStS1) in a unit period are given by 


Qiu = 1 — Ft (44) 

qo. = 1 — e Fst (45) 

Qu = 1 — e#it (46) 

got = 1 — e~ #2! (47) 
where q’1t, q’2 refer to net probabilities of death due to “natural causes”—from 
lung cancer and other diseases respectively, and qi:, 2: refer to net probabilities 
of death due to substances Y and X contained in tobacco smoke—from lung can- 
cer and other diseases respectively. 


Then, for the nonsmokers, the observed crude probabilities of death from 
lung cancer, other diseases, and all causes respectively are 


Qu = : [1 — e#"*] (48) 


B’2 
Qn =— 
2t g’ 


QW, =1-—e%"' (50) 


[1 — e-?"'] (49) 


where 6’ = 6’: +8’s. 

In the case of the smokers, there will be an increment of 6; and B: (which 
measure the force of mortality from lung cancer and other diseases due to sub- 
stances Y and X contained in tobacco smoke) in the instantaneous risks 8’; and 
8’. (which measure the force of mortality from natural causes). Let 8’1+6:=6u, 
B’2+B2=Be, Bu+Be2=8r. Then the crude probabilities among smokers, of 
death from lung cancer, other diseases, and all causes respectively, are 


(51) 
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Boo 
on = — [1 — er 52 
Q ra, eFr'] (52) 


Q: = 1 — e-Frt (53) 


N’ nonsmokers have been observed, of whom d’; have died from lung cancer 
at times ?t’n, t'12, - + « , tsa, and d’, from other diseases at times t’n, t's», +--+, 
t'ea’, and s’=N’—d’,—d’, have survived to the end of the unit period. NV 
smokers have been observed of whom d; have died from lung cancer at times 
tn, tia, > -**, ta, and dz from other diseases at times ta, too, +--+, tea, and 
s=N—d,—d, have survived to the end of the unit period. We wish to obtain 
the maximum likelihood estimates of 8’1, B’2, 61, 82, the corresponding net proba- 
bilities g, and also the crude probabilities Q of death from lung cancer and from 
other diseases, attributable to natural causes and attributable to smoking. 

The joint sample probability for the total sample of experience, the deaths 
from the several causes and their times among the nonsmokers, and also the 
corresponding observations among the smokers, can be set down, and the 
estimates and their variances derived in the standard way as illustrated in the 
previous sections. However, most of the estimates wanted may be had directly 
from those derived in previous sections, from the following considerations. 

Using the observations on the nonsmokers, the time estimates of 81, B’s are 
given by (23), and the frequency estimates by (33), (34). Similarly, from the 
observations on the smokers the estimates of 8 and B22 are given by the same 
formulas. 

Since the observations on the nonsmokers and the smokers are made on 
different samples the estimates 8, and f’; are independent, as are those of Az. 
and §’;. Then 


var. By = var. Bi; + var. 8’; 


var. Bs = var. Boo + var. B’>. 


As for the q’s and Q’s, these are functions of the §’s, and their estimates are 
the same functions of the estimates of the 6’s. The variances of the @’s and 
0’s are derivable from the variances and covariances of the ’s. 

The estimates and their variances for the chief parameters of interest are 
given in Table 2. 

Example.—We take, as an example of the application of the derived formulas, 
some data from the prospective study sponsored by the American Cancer 
Society and reported by Hammond and Horn [10], [11], [12]. Some 200,000 
men in the age range 50 to 70 years were interviewed and a statement obtained 
from each as to his smoking habits. Periodically, inquiry was made and it was 
ascertained when any individual had died, and the time and cause of death as 
stated on the death certificate were recorded. A report was made based on the 
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TABLE 3. DEATHS IN 44 MONTHS. AGE AT BEGINNING, 60-64 YEARS* 


























Nonsmokers Smokers Measure of effect of smoking 
n = 20,278 n=21,594 
Net 
Cause of death Deaths =d Deaths =d seers Diff. Mort. 
ue to ; b.t ti 
. d/n “ d/n smok- | Prob. ratio 
_— Prob.t mm Prob. t ingt 
Lung cancer 10 .00049 103 .00477 | .00449 | .00428 9.7 
Other cancer 218 .01075 325 .01505 | .00469 | .00430 1.4 
Coronary disease 552 .02722 921 .04265 | .01653 | .01543 1.6 
Other diseases 486 .02397 667 .03089 | .00765 | .00692 1.3 
Total: All causes | 1,266 .06243 | 2,016 .09336 | .03299 | .03092 1.5 


























* Data and categories of Hammond and Horn; nonsmokers =never smoked cigarettes regularly; smokers =all 
regular cigarette smokers. 

+ These are, of course, estimates. Their standard errors are, with such large numbers as are involved in these 
investigations, small; the net probability due to smoking is “statistically significant,” at level P <.001 for each of 
the categories of death, 


status of each individual as of 44 months after the initial inquiry. In Table 3 
are shown the essential data for the group of men 60-65 years of age at the 
time of the original inquiry. The binomial estimates of the probability of death 
in the 44-month period of follow-up are shown for the nonsmokers and for the 
smokers, and for each of four categories of cause of death, namely cancer of the 
lung, other cancer, coronary-artery disease, and other diseases, as well as for 
deaths from all diseases. In the last three columns are shown three indices of 
the effect of smoking in increasing the probability of death from each of the 
categorized causes. The first of these indices is the estimate of the net probabil- 
ity of death from the respective causes, using the frequency maximum likeli- 
hood estimate. The second is the simple difference of the probabilities of death 
of smokers and of nonsmokers. The third column gives the “mortality ratio,” 
which here is the ratio of the probability of death among smokers to the 
probability among nonsmokers. 

If we use the net probability of death as the measure of the effect of smoking 
in respect of a cause of death, we see that, among the four categorized causes of 
death, smoking has the greatest effect in increasing deaths from coronary heart 
disease, the next greatest from diseases in the class “other diseases,” the next 
from cancer other than lung cancer, and the least from lung cancer. If we use the 
simple difference of the probabilities of death, shown in the next column, we 
reach essentially the same conclusion. This is not surprising, since it is easily 
shown that if the probabilities of death Q, q are small, the net probabilities are 
given with close approximation by the simple difference of the probabilities 
among smokers and nonsmokers. On the basis of either of these estimates, the 
effect of smoking in causing death from lung cancer is only about 14 per cent 
of the effect in causing death from all diseases, that is, the effect of bringing on 
death from diseases other than lung cancer is more than six times the effect of 
causing death from lung cancer. 
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If we use the “mortality ratio” shown in the last column, a quite different 
impression is obtained. The impression then is that for each of the categories 
other than lung cancer, the effect of smoking is about the same, and, compared 
with the effect on lung cancer, is negligible. We see that the mortality ratio is 9.7 
for lung cancer, while it is less than 2 for each of the other categories. In one 
important report of a prospective study, [3] a ratio of less than 2 was considered 
as hardly worth commenting on as physically significant. If this view is applied 
to the statistics of the table presented, it would be said only that these data 
are evidence that smoking causes lung cancer. Indeed, this is the way these and 
similar findings in other prospective studies have been interpreted. In a number 
of published articles dealing with the effect of smoking, which depend largely 
on these prospective studies, only lung cancer is mentioned! The finding of 
associations of smoking with classes of disease in general, not only, or even 
chiefly, with lung cancer, is of central importance for an interpretation of the 
possible biologic significance of these statistical studies [2], [13]. 
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A NONPARAMETRIC SUM OF RANKS PROCEDURE FOR 
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A nonparametric procedure is presented to test the null hypothesis 
that two independent samples come from the same population against 
the alternative hypothesis that the samples come from populations 
differing in variability or “spread.” Extensive tables of critical values 
are included for n1<n2:<20. Large sample procedures are presented 
which include a correction for tied observations. The test is entirely 
distribution-free under the usual randomization procedures against 
the null hypothesis that the two distributions are identical. The ab- 
sence of any normality assumption is a particularly important feature 
of the test, because its parametric alternative, the F test for variance 
differences, is quite sensitive to departures from normality. The test 
has the additional advantage of being directly applicable to non- 
numerical ordinal data. 


HE procedure developed and discussed in this paper offers a nonparametric 
pee of the null hypothesis that two independent samples come from the 
same population, a test which has considerable power against the alternative 
hypothesis that the samples come from populations differing in variability or 
“spread” but which is relatively insensitive to alternative hypotheses that the 
two populations differ mainly in location. The test is sensitive to differences in 
variability when the location parameters, e.g. means, of the populations are 
equal or nearly equal. 

Tables of critical values for samples of sizes <20 and single-tail probabilities 
of 0%, 0.1%, 0.5%, 1%, 2.5%, 5% and 10% are given. The two sample sizes 
may be equal or different. These tables are equally applicable to the Wilcoxon 
test [18, 20]. Convenient approximations for cases falling outside the scope of 
the tables are discussed. 

The absence of any normality assumption is a particularly important feature 
of this test because its parametric alternative, the F test for differences in vari- 
ances, is quite sensitive to departures from normality. This was first pointed 
out in 1931 by E. S. Pearson [16], and his findings have been confirmed by 
Geary [10], Finch [6], and Gayen [9]. Box [3] has shown that the sensitivity 
to departures from normality may be even greater when more than two vari- 
ances are compared by Bartlett’s homogeneity of variance test. 

A number of attempts have been made recently, notably by Lehmann 
[13], Mood [15], and Sukhatme [19], to develop nonparametric tests of 
variance differences. The procedure suggested by Lehmann, however, has been 
shown (by Mood and by Sukhatme) not to be distribution free. The test 
proposed by Mood assumes knowledge of the means of the two populations, 
whereas in practice such knowledge rarely exists. The procedure suggested by 





1 This paper reports work done when the authors were Fellows at the Center for Advanced Study in the Be- 
havioral Sciences at Stanford, California, 1957-58. 
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Sukhatme also assumes knowledge regarding the location of the two popula- 
tions, and in addition this procedure is not as efficient as the one proposed by 
Mood. Procedures proposed by Kamat [11] and Rosenbaum [17] seem likely 
to lack power. Ansari and Bradley [1] have recently developed a sum of ranks 
test for dispersion. Additional work on this topic has been presented by Freund 
and Ansari [8] and by David and Barton [5]. 

To illustrate the test procedure we propose, we may consider data from a 
study discussed elsewhere ({18], pp. 148-51) in which the amount of aggres- 
sion attributed to film characters by members of two populations, A and B, 
was compared. 

Sample from A: 25 5 14 19 0 17 15 8 
Sample from B: 12 16 6 13 13 3 10 10 11 


In the present procedure, the observations are first combined in a single series, 
in order of increasing (or decreasing) value, retaining their identification as 
a’s or b’s: 


Score: Sot eeéets www fi I 13 13 14 15 16 17 #19 25 


9 
Sample: Sete tp aa & ee 8B RP 2&2 eS +e 6 2 « 


(When ties occur between two observations from the same sample, as in the 
above series, the order in which we arrange the observation does not matter. 
When observations are tied across samples, more caution is required. The 
whole question of tied observations is discussed in a later section of this paper.) 

If we were interested in testing the null hypothesis that these samples are 
from the same population or from populations with identical distribution func- 
tions against the alternative that the populations differ in location, we might 
assign ranks 1 to 18 to this ordering in a direct manner, from lowest to highest, 
determine the sum of the ranks of the a’s (or b’s), and then proceed with a rank 
sum test of a Wilcoxon type [20], one version of which is the Mann-Whitney 
U test [14]. 

However, since we are interested in testing the null hypothesis against the 
alternative that the populations differ in variability, we shall assign ranks 1 to 
18 to the ordered observations in a different manner, one which assigns low 
ranks to extreme observations and high ranks to central observations. We 
assign rank 1 to the lowest number of the sequence, ranks 2 and 3 to the two 
highest members in the sequence, ranks 4 and 5 to the next two lowest, etc. 
(If the total number of observations is odd, the middlemost score is dropped— 
not assigned a rank—in order that the highest assigned rank will be even.) 
This ranking procedure is illustrated with the data already presented: 


Score: O9SEGCE SFC MNMWHUNUB BM YE BW WW Ww 2 
a ae a oe oe oe ee en ee ee ee a ee ee a oe a 
Rank: oe me a oe: oe: oe ee ee ee ee: ee ee 


Assigning the ranks in this way puts the lower ranks at the extremes of the 
ordered sequence and the higher ranks in the middle of the sequence. If the 
null hypothesis were true, the observations from the two populations would 
tend to be well mixed, so that the mean rank assigned to one of the samples 
would tend to equal the mean rank assigned to the other sample. If, on the 
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other hand, the alternative hypothesis were true, we would expect more of the 
observations from the population with greater spread to be near the extremes 
of the ordered sequence and, therefore, to be assigned the lower ranks, and we 
would expect more of the observations from the less variable population to be 
near the middle of the sequence, and, therefore, to be assigned the higher ranks. 
Thus we would expect the mean rank assigned to the observations from the 
more variable population to be considerably smaller than the mean rank as- 
signed to the observations from the less variable population. 

Since the total sum of ranks is fixed, we may work with the sum of ranks for 
either group. If the two groups are of different size, arithmetic convenience 
causes us to choose the sum of ranks for the smaller group. In the example pre- 
sented above, the sum of ranks for the a’s is R,=59. (Had we used the same 
ranking procedure but started with 1 for the highest value, 2 and 3 for the 
lowest two values, etc., we would have found R,’=60.) 

One reason for choosing the above ranking procedure is that it allows the 
use of the same tables as are used for the Wilcoxon rank sum test for location. 
Thus our procedure may be considered a Wilcoxon test for spread in unpaired 
samples. 

The standard tables apply directly and precisely to either way of ranking 
the observations so long as it is chosen in advance of seeing the observations. 
As a consequence , when the standard tables are applied to the mean of the two 
rank sums, 3(R,.+2,’), the result may be shown to be slightly conservative. 


PROCEDURE WHEN NEITHER SAMPLE CONTAINS 
MORE THAN TWENTY OBSERVATIONS 


Let 1: <m be the sizes of the two samples. Call the sample of size n, the first 
sample. Let R, be the sum of ranks for the first sample. Table 1 gives critical 
values of R, for nm <n2<20 and single tail probabilities 0%, 0.1%, 0.5%, 1%, 
2.5%, 5%, and 10%.” As was noted above, these tables are equally applicable 
to the ordinary (Wilcoxon, Mann-Whitney, White, Festinger) rank sum pro- 
cedure for relative location of two unpaired samples. The null-hypothesis 
probability of obtaining a smaller-sample rank sum less than or equal to any 
entry in the left half of the table is not greater than the smaller percentage at the 
head of the column containing the entry. The null-hypothesis probability of 
obtaining a smaller-sample rank sum greater than or equal to any entry in the 
right half of the table is not greater than the smaller percentage at the head of 
the column containing the entry. Where the least extreme rank sum which is 
significant at a given tabular level is also significant at a more extreme level, 
this fact is indicated by the entry of an arrow. (The parenthetic values for 0% 
and 100% are, as these statements imply, those values which are just barely 
impossible.) The tabular values are appropriate linear transformations of the 
values presented by Auble [2]. 

In the example discussed earlier, nx=m=9, and Ri: =59. Reference to the 
appropriate table shows this value to be significantly low at a one-tailed 1% 
level, but not at a one-tailed 0.5% level. Since low values correspond to a 





2 The proposed test is not recommended for ni =1. The tables include values for m =1 for purposes of complete- 
ness. 
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greater spread for sample A, we can conclude with 99% confidence that popu- 
lation A is more spread out than population B. 


PROCEDURE WITH TWO LARGE SAMPLES 


Taking the mean and variance of R,, from the Appendix, and making a con- 
tinuity correction, it is appropriate to treat 


2R, — n(n, oh Ne -{- 1) + 1 

Vni(n1 + ne + 1)(n2/3) 
as a unit approximately normal deviate (under the null hypothesis), provided 
we choose that sign in the numerator which makes the magnitude of z smaller. 


In the example, where the sample sizes are not really large, this procedure 
yields 





(1) 





2160) — 171 +1 


= — 2.290 


which falls at the one-sided 1.1% point, in reasonable agreement with our 
previous result. 


PROCEDURE WITH BADLY UNBALANCED SAMPLES, 
AT LEAST ONE LARGE 


If one sample is small and the other is large, the quantity z of (1) above may 
not be sufficiently approximately normally distributed. If such is the case we 
may find it desirable to adjust z so as to improve the adequacy of the 
approximation. 

Two of the possible adjustments are discussed in the Appendix. The recom- 
mended choice is 


1 1 
Z2= 2+ —.-—.} s° — 3s (2 
(— 10n2 ( 


which is shown to perform quite well for rather extreme conditions. 

Outside the range of the present tables, z will usually be adequate, but z. may 
be easily calculated in cases of doubt. When mn, =2 and n, is large, direct calcu- 
lation of the probability may be advisable, using , with any integer ji, 


Prob (R, < 27) = r 
Ne 
(when n; = 2) 


2)? 
bbe <44 ho 
(na + 1)(t2 + 2) 
TREATMENT OF TIES 


If two tied observations come from the same sample, treating them as ar- 
ranged in either order does not affect the values of R;. It is thus possible to 
neglect such ties in calculating A. 
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If two observations from different samples are tied, “breaking” the tie in the 
two possible ways would lead to different values of R:. It is, therefore, probable 
that we should, in such cases, wish to assign average ranks to the tied observa- 
tions. If we do this, we shall presumably have to allow for its effects in assessing 
significance. 

A rather similar situation arises if there are no ties but we wish to average 
the two sums of ranks obtained by starting, respectively, with the lowest and 
highest observations. We shall then assign to two observations the rank 1.5, 
to two others the rank 3.5 and so on. 

If both samples are atleast moderately large, in any of these cases we need 
only correct the variance of R; and continue to use the modified z which results. 
This change in variance depends, as we show in the Appendix, only on the 
reduction in sums of squares of ranks assigned, and may be compensated for 
by using as the denominator of z in (1) 


Vni(ny + Ne + 1) (2/3) _ 4|nyno/ (m1 + Ne) (ny + i 1)](S; -— S2) (3) 
where S; is the sum of squares of the ranks (not averaged) of the tied observa- 
tions, and S, is the sum of squares of the averaged ranks of the tied observations. 

The reduction in sum of squares may be simply and directly computed. Re- 
ferring again to the example, where tied pairs of observations were ranked 9 
and 12, 13 and 16, and 14 and 15 respectively, if we were to replace these by 
the corresponding average ranks—10.5 and 10.5, 14.5 and 14.5, and again 14.5 
and 14.5, the contribution of these values to the sum of squares would change 
from 





S, = 9% + 12? + 13? + 16? 4+- 14” + 15? = 1071.0 


S. = 10.5? + 10.5? + 14.5? + 14.5? + 14.5? + 14.5? = 1061.5 


the reduction thus being 9.5, which would alter the denominator of z from 4/513 
to +/502.94 and would thus increase the magnitude of z from —2.290 to —2.319. 

While this particular change is not of any great magnitude, we clearly ought 
not to have a choice as to whether or not we accept it. If we are going to use 
average ranks for ties coming from different samples, and if we wish to maintain 
the permutation nature of our procedure, we should use average ranks for all 
ties. Since using average ranks for ties within a single sample does not affect 
R,, since its effect on z is always to make it more significant, we may, if we wish, 
neglect to allow for the use of average ranks within samples, but our significance 
statements will then be more conservative than necessary. 


DISCUSSION 


The application of the test we have described does not require any assump- 
tion that the distribution(s) from which the samples were drawn is normal or 
any other specific shape. Against the null hypothesis that the two distributions 
are identical, the test is entirely distribution-free under the usual randomization 
procedures. 
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The test has the additional advantage of being directly applicable to non- 
numerical ordinal data. Observations which are ordered but not measured on 
any scale arise frequently, especially in research in the behavioral sciences. 

It should be noted that in the presence of differences between population 
means the test tends to lose its sensitivity to differences in variability. The 
larger the difference in means, the lower the probability that the null hypothesis 
will be rejected even in the presence of differences in variability. That is, the 
larger the difference in means, the larger the probability of a Type II Error. 
This is especially true, of course, when the variabilities are small. In the case 
where the populations are disjoint (have no overlap), the power of the test is 
zero. 

It should also be noted, however, that differences between means, in every 
instance so far examined, decrease the probability of a Type I Error. In general, 
rejection of the null hypothesis can almost certainly be attributed to differences 
in variability. 

The power-efficiency of the test against normal alternatives has not been in- 
vestigated as yet. Inasmuch as the two-sample F text for variance differences 
loses both efficiency and validity rapidly in the presence of non-normality, the 
test proposed here should serve as a useful and safer alternative when little is 
known about the population distributions. 


TABLE 1. TABLES OF EXACT CRITICAL VALUES FOR R, 


R,=sum of ranks for smaller sample, n:=size of smaller sample, n2=size of larger 
sample. One-tail % given above blocks; two-tail % given below blocks. (Also applicable 
to the Wilcoxon test [18, 20] in its original form.) 
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TABLE 1—(cont.) 
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TABLE 1—(cont.) 
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TABLE 1—(cont.) 


n, =6 
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n,=9 
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162 | 107 | 113 | 120 | 127 
105 | 110 | 117 | 122 | 131 
108 | 113 | 121 | 127 | 135 
111 | 116 | 124 | 131 | 139 
114 | 119 | 128 | 135 | 144 
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TABLE 1—(cont.) 
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TABLE 1—(cont.) 



























































































































































































































































































































































n= 16 
100 |99.9|99.5| 99 |97.5] 95 | 90 %— 10 5 | 2.5} 1] 0.5] 0.1] 0 
= 0{| 0.1] 0.5} 1] 2.5} 5 | 10 -% 90 | 95 |97.5| 99 |99.5|/99.9] 100 
16 135 | 184 | 196 | 202 | 211 | 219 | 229 299 | 309 | 317 | 326 | 332 | 344 | 393 
7 135 | 188 | 201 | 207 | 217 | 225 | 235 309 | 319 | 327 | 337 | 343 | 356 | 409 
18 135 | 192 | 206 | 212 | 222 | 231 | 242 318 | 329 | 338 | 348 | 354 | 368 | 425 
19 135 | 196 | 210 | 218 | 228 | 237 | 248 328 | 339 | 348 | 358 | 366 | 380 | 441 
20 135 | 201 | 215 | 223 | 234 | 243 | 255 337 | 349 | 358 | 369 | 377 | 391 | 457 
% 0 | 0.2 1 2 5| 10] 20 — 20| 10 5 2 1 | 0.2 0 
nm =17 
‘ 100 | 99.9|99.5] 99 |97.5) 95 90 %- 10 5 2.5 1 0.5} 0.1 0 
ws 0} 0.1] 0.5] 1] 2.5} 5 | 10 -% 90 | 95 |97.5] 99 |99.5]/99.9] 100 
17 152 | 210 | 223 | 230 | 240 | 249 | 259 336 | 346 | 355 | 365 | 372 | 385 | 443 
18 152 | 214 | 228 | 235 | 246 | 255 | 206 346 | 357 | 366 | 377 | 384 | 398 | 460 
19 152 | 219 | 234 | 241 | 252 | 262 | 273 356 | 367 | 377 | 388 | 395 | 410 | 477 
20 152 | 223 | 239 | 246 | 258 | 268 | 280 366 | 378 | 388 | 400 | 407 | 423 | 494 
% 0 | 0.2 1 2 5] 10] 20 “+ 20] 10 A 2 1 | 0.2 0 
ny =18 
100 |99.9/99.5| 99 |97.5] 95 90 %- 10 5 2.5 1 0.5; 0.1 0 
0| 0.1] 0.5} 1] 2.5] 5] 10 —-% 90 | 95 |97.5| 99 |99.5/99.9] 100 
——————E EE Oe . EE | 
18 170 | 237 | 252 | 259 | 270 | 280 | 291 375 | 386 | 396 | 407 | 414 | 429 | 496 
19 170 | 242 | 258 | 265 | 277 | 287 | 299 385 | 397 | 407 | 419 | 426 | 442 | 514 
20 170 | 247 | 263 | 271 | 283 | 294 | 306 396 | 408 | 419 | 431 | 439 | 455 | 532 
o olo2} 1] 2] 5] 120! 2] «> | 20| 10] 5 | 2} 1 | 0.2 0 
nr =19 
100 |99.9]99.5] 99 |97.5| 95 | 90 %— 10 5 | 2.5] 11] 0.5] 0.1] oO 
= 0} 0.1} 0.5) 1] 2.5) 5 | 10 -% 90 | 95 |97.5| 99 |99.5|99.9] 100 
a ——— —— = — 
19 189 | 267 | 283 | 291 | 303 | 313 | 325 416 | 428 | 438 | 450 | 458 474 | 552 
20 189 | 272 | 289 | 297 | 309 | 320 | 333 427 | 440 | 451 | 463 | 471 | 488 | 571 
% | oj|o2} 1] 2] 58] 10 | 20; 20/ 10|/ 5/ 2] 1alo2] o 
nm =20 
t 100 | 99.9|99.5| 99 |97.5] 95 | 90 %— 10 5 | 2.5] 1] 0.5] 0.1] Oo 
. 0} 0.1] 0.5] 1] 2.5] 5 | 10 -% 90 | 95 |97.5] 99 |99.5|99.9] 100 
20 | 209 | 208 315 | 324 | 337 348 | 361 459 | 472 | 483 | 496 | 505 | 522 | 610 
%, 0 | 0.2 1 | 2 5 10 | 20 | - 20] 10 5 2 | 1] 0.2 0 
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APPENDIX 
Test Procedure for Large Samples 


To develop a general test procedure, suppose that samples of sizes mn; and nz 
are drawn from identical populations and are ranked jointly in the special 
manner described earlier. Then the sum of the N =,+ integers is }N(N +1) 
and the sum of their squares is {N(N+1)(2N+1). It follows that the mean 
and variance of the N integers are 


l 
w= —(N +1) (1A) 


| 
c= 2 (N?2 — 1) (2A) 


If mn; and nz are not too small, the means, R,/m, of samples of size n, randomly 
drawn without replacement from the N integers will be approximately normally 


distributed with mean equal to uw and variance equal to 

2 o N -_— ie 

Rik eS 
, ny N- 1 


(3A) 


and thus 





_ Rif — aw Ri/n; — 3(N F 1) 


as (4A) 
TRi/ny V(N? — 1)(N — m)/12ni(N — 1) 


2 





will be approximately normally distributed with zero mean and unit variance.* 
Transforming (4A) so that it is expressed in terms of Ri, the sum of the sam- 
ple of n, observations, and correcting for continuity we have 


2R; - n(n +n2+1) +1 


z= (1) 
Vni(n, + ne + 1)(n2/3) 








Correction for Tied Observations 


If average ranks are assigned to tied observations, the mean p= }(N+1) is 
not affected. However, the variance is reduced to less than 3(N*—1). Since the 
variance is obtained by subtracting the square of the mean from the mean of the 
squares of the N consecutive integers, and since the sum of the average ranks 
is not changed, it follows that the change in variance depends only on the dif- 
ference between the sum of squares of the ranks (not averaged), Si, and the 
sum of squares of the averaged ranks, S.. Thus the variance o? is reduced by 


and becomes 





* Mann and Whitney [14] show that as mi, nz increase in size, the sampling distribution of the statistic U (which 
is linearly related to 21) rapidly approaches the normal distribution. 
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S, — Se 
(5A) 


Substituting the right side of (5A) for the required population variance o? in 
(3A) we get for the denominator of z 


Vni(ni + nz + 1)(n2/3) — [4nyno/(ny + n2)(ny + nz — 1)4(S: — S2) (3) 
Procedure for Badly Balanced Samples 
Two approaches to improving the approximation have been explored. The 
first, the Cornish-Fisher [4] form of the Edgeworth expansion, discussed by 
Kamat ({11], Chap. 6) is 
Y2 
- (23 — 32) (6A) 
24 


where 72 is a relative cumulant of the distribution of R,. It is shown below that 
the distribution of R, has relative semi-invariants (relative cumulants), 


y =0 
6 no?(nm2 + 1) — ny2(ny + 1) 


V2 = 


5 Neny(Ne + ny + 1)(n2 — 14) 


The second approach is an empirical approximation, developed to roughly 
minimize the discrepancy between the desired (exact) values and the approxi- 
mate values of the adjusted z’s, namely 


l l 
gs, = 2+ ( ion Vee — 32). (2) 
10n,; 10n» 


More refined approximations, in the original Edgeworth form giving cor- 
rections to the probability for a given value of z, are given by Fix and Hodges 
[7, pp. 312-3], who noted the great improvement due to the use of the first 
corrective term. 

The adequacy of the two approximations considered here can be tested by 
applying them to the most extreme values for which the tables give exact sig- 
nificance levels, namely n= 2 and 3 for nz=20, and also for n.=10. The results 
are shown in Table 2. 

The quality of performance for these four extreme cases can be summarized 
as follows: 

(1) The use of z itself for | exact 2| <2.0 leads to a maximum error of 0.09 
and a mean absolute error of 0.03. 

(2) The use of the empirical correction for | exact z| <2.5, omitting the one 
instance ny =2, m.=20, Ri <4, leads to a maximum error of 0.05 and a mean 
absolute error of <0.02. 

Most situations outside the scope of the tables presented here will be ade- 
quately treated by the use of unadjusted z. In case of doubt the empirical ad- 
justment should be sufficiently accurate except possibly for n;=2, where the 
precise values are easily calculated. 
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TABLE 2. COMPARISONS OF THE APPROXIMATIONS WITH EXACT 
VALUES FOR SPECIFIED UNBALANCED SAMPLES 











Corresponding z Values 





Exact 
Ri Proba- Normal |Cornish-Fisher| Empirical 
bility Approxi- Approxi- Approxi- 
mation mation mation 
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.003496 
.006993 
.01399 
.0245 
-0385 
.0559 
-0804 























Moments and Cumulants 


Mann and Whitney [14] give the second and fourth moments of the statistic 
U which is linearly related to R by 


R. 


n(n + 1) 
——— — 


U = NyNe 


They show that if 
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bape nyno(ni + m2 + 1) 
E(u?) = — — 


and 


E(u‘) = mama(n +m = ) (5n2?n, + Snen? — 2n2? — 2n,? 
240 
+ 3nen, — 2n2 — 2n)). 

Thus the moment coefficients and relative cumulants are 

i= =90 

B2=3+ 72 
where 

_E (u‘) — 3[E(uv? )|? 


| E(u?) |? 


na(n: oo 1) — ny*(nyq - 1) 


5 ; NN; (Ne + ny + 1)(n2 — fi) 


(For an evaluation of the sixth moment see Fix and Hodges, |7, p. 312]). 
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STATISTICAL EVALUATION OF CLOUD SEEDING OPERATIONS *f 


K. A. BRowNLEE 
University of Chicago 


Modern commercial cloud seeding dates from the discovery in 1946 
that silver iodide crystals are good nuclei for the formation of precipita- 
tion from cold clouds. Whether silver iodide seeding does actually in- 
crease the precipitation over what would occur naturally has been a 
controversial question. 

In 1953 Congress established the Advisory Committee on Weather 
Control, one of its responsibilities being to determine whether the 
United States should experiment with activities designed to control 
weather conditions. In its Final Report, 1957, the Committee concluded 
on the basis of historical records that in mountainous areas com- 
mercial seeding produced an increase of 10 to 15 per cent in precipita- 
tion. The methodology of the Committee’s analysis is reviewed in this 
paper, and it is concluded that possible selectivity on the part of the 
seeding operator could produce fictitious results. The Committee’s 
Report denies the existence of such selectivity but there is reason to 
doubt the denial. In the reviewer’s opinion the Committee’s claim for 
the positive effects of commercial seeding must be regarded as unproved. 


INCE man’s earliest civilizations he has on occasion wished to increase the 
rainfall to increase the yield of his crops. In fact, entire civilizations have 
crumbled in consequence of a secular decline in rainfall. In our present civiliza- 
tion, weather modification, if it is possible, would be of great economic im- 
portance. For example, hurricanes, droughts, floods, and hail storms cause 
economic loss. Those who live or own property in low-lying territory liable to 
flood would be happy if floods could be prevented, those raising crops in areas 
of notoriously inadequate rainfall would like the precipitation increased. In 
some instances we can affect the weather by sheer brute force. Fog has been 
dispersed locally by the burning of fuel oil to permit the landing of military 
aircraft. Presumably, exploding a large atomic bomb in a small thunderstorm 
would bring that thunderstorm to a close. However, what is relevant is whether 
changes in the weather can be made economically, in the sense that the benefits 
outweigh the costs. 

Modern interest in weather modification, specifically increase in precipita- 
tion, dates from 1946 when V. J. Schaefer dropped dry ice into supercooled 
clouds over Massachusetts and observed that holes were formed in the clouds. 
In the same year, B. Vonnegut showed that in laboratory cold boxes silver 
iodide crystals were good nuclei for the formation of ice crystals. The relevance 
of this is that, according to the Bergeron-Findeisen theory, the formation of ice 
crystals is an essential step in the formation of precipitation from cold clouds. 
A large amount of government-sponsored research followed. By 1948 commer- 
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cial operators claiming to increase precipitation, usually with ground based 
silver iodide generators, were in business. Their claims were disputed by some 
meteorologists and in 1953 Congress set up the Advisory Committee on 
Weather Control. The Committee consisted of representatives of the Depart- 
ments of Agriculture, Commerce, Defense, Health (Education, and Welfare), 
and the Interior, and the National Science Foundation, along with five “per- 
sons in private life of outstanding ability in the fields of science, agriculture, and 
business.” Section 3 of the act reads: 

“The Committee shall make a complete study and evaluation of public and 
private experiments in weather control for the purpose of determining the ex- 
tent to which the United States should experiment with, engage in, or regulate 
activities designed to control weather conditions.” 

The two volumes under review here form the Committee’s Final Report. 

Volume I contains a Summary of Findings, of which the most important is 
that the Committee concluded: 

“(1) The statistical procedures employed indicated that the seeding of win- 
ter-type storm clouds in mountainous areas in western United States produced 
an average increase in precipitation of 10 to 15 per cent from unseeded storms 
with heavy odds that this increase was not the result of natural variations in 
the amount of rainfall. 

(2) In nonmountainous areas, the same statistical procedures did not detect 
any increase in precipitation that could be attributed to cloud seeding. This 
does not mean that effects may not have been produced. The greater variability 


of rainfall patterns in nonmountainous areas made the techniques less sensi- 
tive... 
(4) Available hail frequency data were completely inadequate for evalua- 


tion purposes... ” 


The Recommendations of the Committee do not include any recommenda- 
tion that one or more properly designed and excuted experiments should be run 
to test whether commercial seeding techniques have any effect in increasing 
precipitation. Since in its Findings the Committee had already concluded that 
commercial seeding techniques are effective in mountainous areas, they might 
have felt that such a recommendation for mountainous areas was inappropriate, 
but their findings had specifically given no conclusion for nonmountainous 
areas and it might have been justifiable to reeommend further work in such 
areas. 

The remainder of Volume I summarizes the historical background, the ac- 
tivities of the Committee, and the papers on the statistical and physical evalua- 
tion which form the bulk of Volume II and which are the basis for the Findings 
quoted above. 

In this paper we are not directly concerned with the fifteen papers in Volume 
II on the physical evaluation program, nor with the paper on the legal situa- 
tion. The section on the statistical evaluation program includes six technical 
reports. The most important from a statistical and practical viewpoint are 
Nos. 1 and 2, both by H. C. 8. Thom, on “A Statistical Method of Evaluating 
Augmentation of Precipitation by Cloud Seeding,” which sets forth an analysis 
closely related to a covariance analysis, and “An Evaluation of a Series of 
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Orographic Cloud Seeding Operations,” which applies the techniques of Report 
No. 1 to several sets of historical data. These reports will be discussed later in 
this review. Report No. 3, “The Frequency of Hail Occurrence,” by H. C. 8. 
Thom, concludes that the distribution of the occurrence of hail follows a nega- 
tive binomial distribution and discusses the estimation of the parameters. Re- 
port No. 4, “A Method for the Evaluation of Hail Suppression,” by H. C. S. 
Thom, develops a sequential test for the situation in which the underlying dis- 
tribution is the negative binomial. No. 5 by Gerald D. Berndt, on “An Evalua- 
tion of Commercial Cloud Seeding Operations Conducted During the Summer 
Months in South Dakota” introduces the concept of a cost/benefit ratio and a 
Bayes approach. It appears to use the average rain for those days in which rain 
did fall, rather than for all relevant days. Thus, if seeding completely sup- 
pressed rain on certain days, this effect would not show up in the average con- 
sidered. No. 6, by Max A. Woodbury, is concerned with prediction of what 
would happen in the absence of seeding, using relative humidity and similar 
variables. For monthly precipitation in South Dakota relative humidity gave 
a correlation coefficient of 0.614 and the addition of two more sophisticated 
variables raised this to 0.623. 

Volume II also includes a number of miscellaneous items. Appendix A repro- 
duces the Act of Congress establishing the Committee. Appendix B is largely a 
diary of the movements of the staff of the Committee. Appendix C is two pages 
on “The Physical Basis of Cloud Modification” by F. A. Berry, Chief Scientific 
Advisor to the Committee. Appendix D is a bibliography of 2259 items. The 
value of this bibliography is lessened by the inclusion of much trivia in such 
media as the Washington Post Parade Magazine or Colliers. As a check on the 
accuracy of this bibliography the reviewer took a sample of one, admittedly 
biased, and looked up his own name in the author index. His name was therein 
as paper number 173, but reference to that number showed another name. The 
reviewer was also unable to find any reference to the important mimeographed 
Report No. 24, “Artificial Nucleation of Cumulus Clouds,” by Roscoe R. 
Braham, Jr., Louis J. Battan, and Horace R. Byers. At the time the Final 

teport was written this experiment was almost unique in being a carefully de- 
signed and executed randomized experiment on cloud modification. Amongst 
the tiny class of completed randomized experiments it was unique in showing 
a statistically significant positive result for the rather special circumstances and 
seeding technique involved. The failure of the Final Report to give an explicit 
account of this work is extraordinary. 

The general lines of the appropriate statistical methodology for testing the 
null hypothesis that seeding is without effect appear pretty clear. If a particular 
site, denoted by the “target,” is seeded at every opportunity, then the estimated 
rain which falls must be compared with an estimate of the rain which would 
have fallen in the absence of seeding. One way of getting such an estimate is to 
take the historical average. This proves unsatisfactory for a number of reasons: 

(a) The rain gauge system used in the past is frequently very inadequate 
for this purpose. Any improved rain gauge system installed to check on the 
effects of seeding will not be comparable with the historical gauge system. 

(b) The variability of rain at one site over time is so great that enormous 
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numbers of observations are needed to give reasonable probability of detecting 
a reasonable change. 

(c) The assumption that the mean rainfall at a given site is unchanged is 
open to question. It seems clear that long and short term changes do take place. 

The design can be greatly improved by using another site as a “control,” 
and making an analysis of covariance using the rainfall on the control, say z, 
as the concomitant variate for the adjustment of the rainfall on the target, y. 
The control must not be so close to the target that it is accidentally seeded 
when the target is seeded. On the other hand, to be of any use in reducing the 
variance of y given x, V[y|x], the control must be close enough to have a good 
correlation with target. 

In a satisfactory experiment, a workable and objective definition of the 
“experimental unit” must be determined. This might be an arbitrary period, 
. for example, a week, or a meteorological unit, for example, the passage of a high 
pressure ridge over a meridian, or a “storm.” 

It is frequently difficult to formulate a completely satisfactory definition of 
unit. An arbitrary definition such as a day or week satisfies the requirement of 
objectivity, but it seems unsatisfactory to shut down the seeding generators in 
the middle of a storm and attribute all rain that fell after that moment to an 
unseeded period. Dummy periods could be inserted routinely between experi- 
mental periods but this will waste time which is precious. With natural meteor- 
ological units, depending upon the criteria for the unit there may be danger of 
subjectivity in deciding when a particular unit has ended. For the experiment 
to be valid either the criteria must be absolutely objective or, if the criteria 
contain some subjectivity, the decision that a particular unit has ended must 
be made by a person in complete ignorance of whether that unit was seeded 
or unseeded. On the one hand, completely objective criteria drawn up before 
the start of the experiment may subsequently encounter unanticipated circum- 
stances which make the criteria look rather silly: on the other hand, ignorance 
of a member of the research team is difficult to maintain. The determination of 
a satisfactory unit is usually by no means easy. 

In any actual experiment a choice must be made of the seeding technique. 
In view of the claims made for ground based silver iodide generators, an experi- 
ment using this equipment in a similar mountainous area would be desirable. 
Another experiment could be to seed at a very high rate directly into clouds 
from one or more airplanes. Conceivably this latter technique, though more ex- 
pensive to operate, might be more effective. 

Assuming that the seeding technique has been selected and the experimental 
unit defined, as each unit occurs a randomized choice is made whether to seed 
or not to seed. The observations will then be y,; for the target and 2; for the 
control, i=1 for the unseeded units and 1=2 for the seeded units; the index 7 
refers to the sequence of the units in each class. A natural statistical analysis 
is to compare 4;, with f2., adjusting for covariance on the z,;. This, of course, 
is equivalent to fitting straight lines to the two classes of data, testing for 
parallelism, and then testing for coincidence of the lines. 

This is essentially the procedure followed by Jeeves, LeCam, Neyman, and 
Scott, reported in 1952 and 1953, and published in 1955 [1]. Thom, in Report 
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No. 1 in Volume II, published in 1957, followed the same procedure in broad 
outline but with some modifications. 

The analysis requires homoscedasticity, i.e., that V[y|z] should be a con- 
stant, and also the analysis is much more convenient if the regression of y on x 
is linear. The California group, Jeeves et al., used a square root transformation 
of both variables. Others have found a logarithmic transformation satisfactory. 
Thom uses a more complicated transformation. He claims that the rainfall at a 
given site has a gamma distribution. He then estimates the parameters of these 
distributions and transforms the actual observations into a unit normal dis- 
tribution. Since all that is required for the covariance analysis is normality and 
homoscedasticity in the residuals about the regression lines, and these appear 
to be closely achieved by the square root or logarithmic transformations, this 
step appears to be unnecessarily elaborate, but not harmful. 

Thom also remarks that, “Since there is a small correlation between succes- 
sive storms but practically none between alternate storms, our recommended 
practice is to fit odd and even storms in a period separately and combine the 
two estimates by averaging.” What is relevant, of course, is correlation between 
the residuals and it is not clear whether this remark is referring to residuals or 
the storms themselves. 

Thom also differs from standard practice in fitting a line to only the unseeded 
set of data and his test statistic is d/-/ Pld], where d; is the mean of the de- 
viations of k seeded points from the values predicted by the estimated historical 
regression line. No explanation is offered for this asymmetric treatment of the 
two classes of points. If, in point of fact, the vast majority of points are in the 
unseeded class, as in the data to which Thom does apply this analysis, then 
there will be little difference between this procedure and the standard one of 
fitting parallel lines to the two classes. On the other hand, if the sizes of the 
two classes are roughly equal there will be some inefficiency in Thom’s analysis. 
Also, if there is nonparallelism of the regression lines, this would indicate that 
seeding has an effect which varies according to the intensity of the storm. If 
the effect is negative for small z and positive for large z, or vice versa, the situa- 
tion could occur in which the effect averages to zero and a consideration of d; 
would not show anything. 

A further defect of Thom’s procedure is that it uses the variance of the in- 
dividual historical points about the historical regression line as an estimate of 
V [y| x] which is then inserted in the formula for V [d,]. It is possible that the 
seeded data has a larger (or smaller) variance than the historical data which is 
unseeded. The standard analysis, of course, includes the estimation of V [y| x| 
separately for seeded and unseeded data and permits a check on the null 
hypothesis of their equality. 

Most data appear to conform well with the assumptions of the standard 
covariance analysis and there does not seem to be much need to go to the 
analogous nonparametric procedures which have been described. 

Parenthetically, a more sophisticated design has been proposed in which two 
areas are observed, and randomly one or the other is seeded, thus giving an 
analog to a cross-over design. At the time of the Committee’s Final Report 
such a design had not been used anywhere, and we will not discuss it further. 
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The foregoing discussion has presupposed a properly randomized experi- 
ment, in which there would be two treatments applied to the target, seeding 
and not-seeding. Unfortunately, at the time the Committee was established 
there had been very few, if any, experiments, in the sense in which a statisti- 
cian would accept the word. The Committee did feel free to sponsor experi- 
mentai work on “what takes place in clouds when silver iodide crystals are re- 
leased at the ground or from aircraft,” in fact they were responsible for what 
they called Project Overseed, Project Sailplane, Project Skyfire, Arizona, 1956, 
Project Skyfire, Montana, 1956, Project Seabreeze, and Project Skyfire, Mon- 
tana, 1957, but they took the position that they could not run or sponsor any 
true experiments on rain making: 

“From the outset the Committee was well aware that within its statutory 
authorization it could undertake a program of only limited scope: 

*“(1) It could not, for lack of sufficient time, carry on long-range research, 
desirable if not essential to evaluation, such as scientifically designed, random- 
ized cloud seeding experiments. It had, therefore, no alternative but to employ 
available ‘historical’ data as the material for evaluation studies.” 

Public Law 256 of the 83d Congress, approved August 13, 1953, required the 
Committee to submit a final report not later than June 30, 1956. This was 
amended to June 30, 1958 on July 9, 1956. It is clear that Congress, in handing 
the Committee the task of making “a complete study and evaluation of public 
and private experiments,” had given them a futile assignment when in fact at 
that time there were virtually no experiments available to evaluate. 

The Committee chose to put a substantial effort into the evaluation of a 
number of commercial projects and the associated historical data (Report No. 2 
of Volume II). These were divided into three groups, one about mountainous 
areas on the west coast, the second about semi-mountainous areas, and the 
third about non-mountainous areas. For the first group they found that the 
seeded data showed an increase of 14 per cent over the historical expectation, 
a result significant at the 0.01 level of significance. The Committee used this 
as the basis of their main conclusion, that seeding in this type of area produced 
an average increase in precipitation of 10 to 15 per cent. 

If this analysis had been based on data arising from properly randomized 
experiments this conclusion would be acceptable, but, since it is based on non- 
randomized data, it needs careful scrutiny. 

Jeeves et al. [1] had remarked that in the Carrizo Plain area storms could 
be divided into several types, and in general the regression lines for the different 
types were not the same. Thus if the proportions of these types in the later 
data were different from the proportions in the earlier data, then there would 
be a difference between the two sets of data. If the later data corresponded to 
seeded data, the change might be attributable to the effect of seeding whereas 
in reality it might be due to this change in proportions of types. 

Furthermore, if the seeder, consciously or subconsciously, chooses to seed 
only storms of those types for which the target shows up favorably compared 
to the control, then he could produce a fictitious positive effect of seeding. 

Thom discusses this possibility and states, “As to the operator’s seeding of a 
nonrandom selection of storm types, it may be stated categorically that he did 
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not.” For the operator to work this device effectively, he would have to guess 
which control area would be used. Clearly each case would have to be examined 
carefully to see whether in fact there were plausible alternative control areas. 
The operator would also have to select storms for seeding, which implies that 
some storms would go unseeded. Thom remarks, “Insofar as we could deter- 
mine, they (the operators) seeded every storm except those occurring in periods 
when they were specifically instructed not to seed by their clients. They were 
quite skilful in seeding all allowable storms for there were very few left un- 
seeded in the storm series.” This is clear enough, but Neyman, Scott, and 
Vasilevskis [2] reviewed commercial operations by nine operators in California 
in 1951-2 and found that for those days under contract in which more than 
0.10 inches of rain fell, 41 per cent were not seeded. These figures and Thom’s 
assertion appear inconsistent. 

The potential importance of this phenomenon of the seeder choosing only to 
seed a fraction of the seeding opportunities is illustrated by two scatter dia- 
grams of Neyman et al. [2]. In an experiment in 1957-59 in Santa Barbara, a 
commercial operator was to report occasions suitable for seeding to a “random- 
izer” who would then tell him whether or not to actually seed the target. It 
turned out that the seeder reported as seeding opportunities 51 per cent of those 
12-hour units of observation in which more than 0.10 inches of rain fell on the 
target. Those occasions he diagnosed as suitable for seeding gave a relatively 
high precipitation on the target compared with the control, whereas those occa- 
sions which he failed to diagnose as suitable for seeding gave a relatively low 
precipitation on the target compared with the control. Selectivity of this na- 
ture would give a fictitious positive effect for seeding. In the present experi- 
ment we are able to observe it, since half the occasions declared suitable for 
seeding were not seeded, through the chance effect of the randomization pro- 
cedure. Since half the storms suitable for seeding were in fact not seeded, we 
can observe the effect of selection without the picture being confused by the 
presence or absence of seeding, and we observe that the effect of selectivity is 
to favor the target. 

To belabor this point, since it is crucial to the entire question, we have two 
groups of storms, those declared suitable for seeding and those declared un- 
suitable. In the present experiment half of the former group were left unseeded, 
and so can be compared directly with the latter group: they are different, favor- 
ing the target. In ordinary seeding operations, on the other hand, all the former 
group would be seeded. They would be compared with the latter group, and 
found to be different, the target having relatively large amounts of rain. This 
increase would be attributed to the seeding which in ordinary seeding opera- 
tions would have been performed. From the randomized experiment, we see 
that this increase is completely attributable to the selectivity. 

It is clear that the Committee’s analysis of the historical data leaves un- 
settled the basic question of whether seeding of the type used in the commercial 
operations under analysis has any effect on precipitation. The Committee’s 
main conclusion is thus unjustified. It is surprising that they were prepared to 
make this conclusion, for its invalidity was clearly set out in 1955 in [1], and 
furthermore after the Committee stated this conclusion in their First Interim 
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Report on February 8, 1956, they were strongly criticized by several speakers 
at the Conference on the Scientific Basis of Weather Modification Studies at 
the University of Tucson in April 1956. 

The Committee was hardly a shoe-string operation. For fiscal 1955, 1956, 
and 1957, $120,000, $275,000, and $300,000 were appropriated. For fiscal 1958 
the Senate Appropriations Committee recommended $256,000, but the House 
denied further funds. On July 1, 1957 the Committee was without funds, but 
on August 28, $100,000 was available for use before the end of the calendar year 
for preparing and publishing the Final Report. 

It is frustrating to contemplate the possibility that the above money, 
$795,000, if devoted to one or two deliberate, well-planned and executed experi- 
ments, might have settled the question. As it is, the emotions generated by this 
topic make the carrying out of a satisfactory experiment a Herculean task. 
Apart from the considerable scientific and logistic problems, scrupulous atten- 
tion to the possibility of bias, conscious or subconscious, and fraud is necessary. 
The question of whether it is necessary to station guards at the gauges to pre- 
vent “salting” by interested parties needs serious examination. The Final Re- 
port of the Advisory Committee on Weather Control has settled none of the 
important issues, and its failure points up the need for a truly objective deter- 
mination of these questions. 
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THE USE OF STATISTICS IN THE FORMULATION AND 
EVALUATION OF SOCIAL PROGRAMMES 


Octavio CABELLO 
Statistical Office of the United Nations* 


Social statistics are defined as the statistics required for social pro- 
grammes, and a social programme as the scheme of public and private 
activities which have a direct bearing on a particular aspect of living 
conditions. It is assumed that there is a separate programme for each 
component of the levels of living, e.g. health, education, housing. 

The uses of statistics for the following aspects of social programming 
are examined and illustrated with examples from the field of housing: 
estimating of economic resources for the programme; measurement of 
living conditions by statistical indicators; estimating of social needs, 
e.g. “dwellings,” to maintain or improve the levels of living of growing 
populations; testing of the consistency of social needs and economic, 
human and material resources; and adoption of goals for social pro- 
grammes consistent with expected resources. 

Several forms of evaluation are briefly discussed, and the indicators 
of levels of living are considered to be the only comprehensive yard- 
sticks of the success or failure of social programmes. 

The purpose of the paper is to provoke dissemination of information 
on this subject among under-developed countries and to stimulate 
research on the uses of statistics for social programming. 


1, INTRODUCTION 


HE role of statistics in guiding decisions regarding micro aspects of social 
f potestobn seems to be well established. But, the possibility of using sta- 
tistics in formulating social programmes becomes uncertain as the scope of 
such aspects increases, and the question of whether and how statistics can be 
used in formulating social programmes at the national level has no obvious 
answer. As a result, while it may be assumed that minor decisions in the social 
field can be, and frequently are, based on scientific information, the major 
decisions regarding social policy and programmes—especially in under-de- 
veloped countries—are mainly a matter of personal judgement based primarily 
on political expediency and traditional grounds. 

The purpose of this paper is to explore the possibility of using social statistics 
in formulating national social programmes, to provoke wider dissemination of 
information on this subject in the under-developed countries and to stimulate 
further research on the methodology of formulating national social programmes. 

The discussion here has been restricted in some cases to the field of housing, 
but it is hoped that similar applications may be found in other social fields. 


2. SOCIAL STATISTICS AND SOCIAL PROGRAMMES 


The first problem that confronts anyone discussing social statistics is to 
decide what meaning to attach to the concept. In current usage the scope of 
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social statistics ranges from a narrow coverage which includes only one aspect 
of social conditions, such as crime or the volume of work of welfare institutions, 
to the broadest possible range of subjects related to the description of the social 
structure, social institutions, human relations, environmental conditions, and 
government and private activities affecting living conditions. It seems that in 
spite of the attempts that have been made, social statistics cannot be identified 
as a unitary field by reference to common units of observation, or methods of 
collecting and analysing data. However, “social statistics” undefined have been 
the subject of continued attention in recent years, and this may be sufficient 
justification for preserving the expression (social statistics), defining in each 
case the meaning attached to it. 

For the purpose of this discussion social statistics are defined as the statistics 
required for the formulation and evaluation of broad national social pro- 
grammes. This definition focuses attention on the use or purpose of the statistics 
rather than on the nature of the units of observation or on the methods of col- 
lecting and analysing the data. Though non-rigorous, this definition is useful 
since it stresses the principle that statistics, and government statistics in par- 
ticular, must always have a well defined purpose. 

Little would be gained, however, by defining social statistics as the statistics 
required for social programmes unless social programmes can be defined inde- 
pendently of social statistics. A social programme may be considered to be 
formed by the scheme of public and private activities which would contribute 
to maintain or improve some particular aspect of the living conditions. The 
aim of such a programme would be to raise the levels of living at the maximum 
practicable rate commensurate with long-term economic development and 
human resources, and in accordance with nationally and internationally ac- 
cepted principles regarding human rights and responsibilities. 

It may be recalled that, in 1953, a United Nations Committee of Experts on 
International Definition and Measurement of Standards and Levels of Living 
recommended the use of twelve separate components of the levels of living,! 
each component representing a well delimited aspect of the levels of living 
(i.e. of the actual living conditions). In view of this recommendation and of the 
fact that aims of social policy are usually stated with reference to particular 
fields, such as education, health, housing and nutrition, it will be assumed, for 
the purpose of this discussion, that a separate social programme will be formu- 
lated for each component of the levels of living. Such an assumption should be 
acceptable since the techniques and factors involved in improving living condi- 
tions in each field are different and separate authorities or government agencies 
are responsible for the conduct of the various programmes. 

The activities involved in the various programmes are of a different nature 
and, therefore, the methods of formulating the programmes are also quite 
different. Housing programmes, for example, are concerned with maintaining 
an adequate supply and distribution of dwellings; education programmes with 
maintaining an adequate supply and distribution of teaching services and 
facilities; nutrition programmes with premoting adequate food consumption. 





1 United Nations, Report on International Definition and Measurement of Standards and Levels of Living, Docue 
ment No. E/CN.3/179-E/CN.5299, New York: United Nations, Sales No. 1954.IV.5. 
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Health programmes are conceptually more complex since the levels of health? 
depend upon a great many factors including housing, education, nutrition and 
so forth. Some social programmes are concerned with the supply and distri- 
bution of certain essential goods or services; others with the prevalence of 
certain conditions such as social security or human freedoms. Statistics may be 
useful in connexion with every type of programme but the uses will differ 
according to the characteristics of each programme. 

Although it is expedient to consider each social programme separately, it 
must be recognized that they tend to be highly interrelated in the sense that the 
execution of one programme has direct implications for the living conditions in 
other social fields, e.g. malaria eradication, food supply and labour force; im- 
provement of levels of education and housing conditions and levels of health; 
sanitation and nutrition, etc. Furthermore, there is an evident inter-action be- 
tween social programmes and economic development. However, there is no 
mechanism for integrating social programmes in terms of activities and units 
of “well-being,” such as is found in the economic area in the form of a system 
of national accounts, input-output tables and mathematical models, which 
aggregate and relate economic actions in terms of monetary units, and it is, 
therefore, necessary to deal with each social programme separately. 


3. USES OF STATISTICS IN FORMULATING A SOCIAL PROGRAMME 


The task of formulating a national social programme does not consist in 
solving a system of equations. Mathematical models describing the effect of 
social measures on the levels of living, the cost of such measures and the physi- 
cal inputs involved have not been established. Such models would be useful, 
even if based solely on non-tested hypotheses and informal estimates, since 
they would set forth in an explicit form the “laws” or the assumptions involved 
in formulating the programmes. Unfortunately, complex and complete hypoth- 
eses, models, or laws describing the interaction of demographic, social and 
economic factors seldom exist, and the formulation of national social pro- 
grammes can only be based on simple hypotheses, hardly more dignified than 
crude hunches. However, carefully selected statistics can materially enlighten 
the bases for the decisions by providing the factual information required to en- 
sure compatibility between social needs and resources, and the statistics record- 
ing the experience of previous years will provide perhaps the most valuable 
guide to planners. 

From a practical point of view, estimates of the economic resources, i.e. 
expenditure and investment by the public and private sectors, provide a suitable 
point of departure for the formulation of short term programmes in a particular 
social field. As a next step, the volume of public and private activities needed in 
order to maintain or improve the levels of living might be estimated and the 
total value calculated by using unit costs. The cost of the national programme 
could then be compared with the resources. If the resources appear to be in- 
sufficient to maintain or moderately raise the levels of living of a particular 





2 Health is defined in the constitution of the World Health Organization as follows: “Health is a state of com- 
plete physical, mental and social well-being and not merely the absence of disease or infirmity.” See: United Nations, 
Final Acts of the International Health Conference, held in New York from 19 June to 22 July 1946, Document No. 
E/155, New York: United Nations, October 1946. 
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component, a revision of the resource allocation would need to be considered, 
together with the possibility of improving the efficiency and productivity of 
the agencies and industries involved by introducing new methods or procedures. 
After all possibilities are explored—and statistics would be essential for such 
an examination—a social programme could be outlined which would indicate 
the volume of activities that would be likely to be performed and the probable 
consequences of the programme upon the levels of living. 

It is true that there are other limiting factors of social programmes, besides 
the economic resources, which must also be duly considered, but the system of 
economic statistics provides a partial linkage of great operational significance 
between economic development and social programmes. In effect, for practical 
purposes and considering only short term programmes, capitalist and socialist 
countries alike consider social development as a by-product of economic de- 
velopment, and resources for social programmes are estimated or allocated on 
the assumption that adequate provision is made for ensuring—as a matter of 
first priority—an adequate rate of economic development? and a proper balance 
between consumption and production, in order to avoid inflationary pressures. 

On the assumption that the process broadly outlined above would be fol- 
lowed in formulating a social programme, statistics would be required for the 
following major aspects of the programming operation: 


a. To estimate the economic resources expected to be available for the social 
programme, 

b. To measure living conditions by means of indicators of the levels of living, 
To estimate the social needs, i.e. the supply and distribution of “social 
units,” required for maintaining or improving the levels of living, 

. To test the compatibility of the estimated social needs with the expected 
economic and human resources, 
To select the most convenient combination of activities, i.e. a social pro- 
gramme. 


Uses of statistics to estimate economic resources expected to be available for 
the social programme 3.1 


Considerable attention has been given to the application of statistics in con- 
nexion with problems of economic planning and development. Principal sources 
of information regarding expenditure and investment in previous years are the 
national accounts‘ and the government accounts® when these systems provide 
separate data for the several social programmes. It is not, however, an easy task 
to separate out the data for various categories and eventually obtain separate 
totals for the country as a whole. For example, the apportionment of certain 
items of public expenditure among related social fields involves problems of 
interpretation, such as the allocation of school feeding programmes to health 
or to education; the allocation of urban water supply projects to health, hous- 





3 In capitalist countries by means of an adequate rate of investment in the means of production. In socialist 
economies by securing a rate of increase of the output of means of production greater than the rate of increase of the 
output of consumers’ goods. 

4 United Nations, A System of National Accounts and Supporting Tables, Document No. ST/STAT/Series 
F./No.2/Rev.1, New York: United Nations, Sales No. 1959.X VII.11. 

5 United Nations, A Manual for Economic and Functional Classification of Government Transactions, Document 
No. ST/TAA/M/12—ST/ECA/49, New York: United Nations, Sales No. 58.X VI.2. 
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ing, or other programmes; the allocation of the cost of marsh clearance to 
health or to agriculture, and so forth. Some of the statistical data commonly 
used in evaluating the economic resources devoted to housing programmes, are 
the following: 


a. Expenditure for dwelling construction. This item, from the National Ac- 
counts includes all expenditure on new dwelling construction and major 
alterations to residential buildings including the value of the change in 
work, in progress, the cost of painting and all permanent fixtures, but ex- 
cluding the value of the land before improvement, 

. Rent and water charges. This item of private consumption expenditure 
includes all gross rent (actual and imputed gross rent on owner-occupied 
houses as well as actual and imputed grounds rents payable) including 
water charges and local rates. 


These items provide information about the order of magnitude of the eco- 
nomic resources devoted to housing in the past. When related to the national 
income and other national aggregates they provide ratios which are useful in 
estimating future resources, on the assumption that past patterns of expendi- 
ture and allocation would prevail or change in a foreseeable direction. The 
following ratios may be used for this purpose: 


a. Expenditure on dwelling construction as a percentage of gross domestic 
product, 
b. Expenditure on dwelling construction as a percentage of gross fixed capital 


formation, 
c. Rent as a percentage of private consumption expenditure. 


Estimates of the gross domestic product, private consumption expenditure 
and fixed capital formation, over the period to be covered by the programme, 
will usually be obtained by the agencies forecasting economic development. On 
the basis of these estimates it would be possible, by applying assumed ratios 
based on past experience and other factors, to derive estimates of the order of 
magnitude of the national resources that are likely to be devoted to housing in 
the next few years. Such estimates, would, of course, need to be revised every 
year, in the light of new information. This procedure is a very crude one but as 
crude as it is it may be considered too elaborate for many underdeveloped 
countries. 

As an additional guide in estimating resources it may be useful to compare 
the situation or patterns of expenditure in various countries. Table 1 shows the 
values of the three ratios described above for 32 countries, in 1957 or 1956. 
The variation among countries is considerable and apparently there is no rela- 
tion between the ratios and the state or rate of development of the countries. 
Also, the ratios in the countries with long historical data available show long 
term and cyclical variations as well as‘irregular changes. International com- 
parisons, therefore, show only the range of variation of these ratios but would 








* The evaluation of the economic significance of housing programmes is discussed in United Nations, Administra- 
tive Committee on Co-ordination, Statistical Indicators of Housing Levels of Living, a methodological study of a 
preliminary nature, Document No. ACC/WPSSP/1/4/Add.6, New York: United Nations, 6 August 1959. 





EVALUATION OF SOCIAL PROGRAMMES 459 


be rather unhelpful for the purposes of selecting suitable rates of expenditure 
investment in housing. But long national series would be extremely useful guides 
in determining the resources that are likely to become available, over a period 
of a few years, for housing as well as for other social programmes. 

An important factor in connexion with estimating the economic resources 
available for social programmes is the distribution of resources between the 
public and private sectors. In some countries it has been found to be more effi- 
cient to finance important aspects of social programmes through government 


TABLE 1. EXPENDITURE FOR DWELLING CONSTRUCTION AS A PER- 
CENTAGE OF GROSS DOMESTIC PRODUCTS AND GROSS FIXED 
CAPITAL FORMATION, AND RENT AS A PERCENTAGE OF 
PRIVATE CONSUMPTION EXPENDITURE.* SELECTED 
COUNTRIES. 1956 OR 1957 
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Austria 1957 
Belgium 1957 
Canada 1957 
Ceylon 1957 
Denmark 1957 
Dominican Republic 1957 
Ecuador 1956 
Finland 1957 
France 1957 
Ghana 1957 
Greece 1957 
Ireland 1957 
Israel 1956 
Italy 1957 
Japan 1957 
Korea, Rep. of 1957 
Luxembourg 1957 
Mauritius 1957 
Morocco 1956 
Netherlands 1957 
New Zealand 1957 
Nigeria, Fed. of 1956 
Norway 1957 
Panama 1956 
Peru 1956 
Philippines 1957 
Portugal 1957 
Puerto Rico 1957 
Sweden 1957 
United Kingdom 1957 
United States 1957 
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* At current market prices. 
Source: United Nations, Administrative Committee on Co-ordination, Statistical Indicators of Housing Levele 
of Living, Document No. ACC/WPSSP/I/4/Add.6, New York: United Nations, 6 August 1959. 








460 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1960 


agencies, while in others the direct purchasing of essential goods and services 
by the individual households is considered possible and satisfactory. The gen- 
eral situation may be considered to be a mixed arrangement by which pro- 
grammes are financed through both the public and private sectors. It is im- 
portant, therefore, to have information about the extent to which the various 
programmes are financed by each of the two sectors. This question was dis- 
cussed in a paper’ presented to the fifth session of the ECAFE Working Party 
on Economic Development and Planning, held in Bangkok, in September 1959. 
Table 2, which was taken from this paper, illustrates how data on resources 


TABLE 2. PROVISION AND FINANCING OF EDUCATIONAL SERVICES 
(IN MILLIONS OF NATIONAL CURRENCY) 








” Public sector Private sector Total 
Financed by: : 





Provincial Expenditure 
and local | House-| Enter- |for providing 
Provided by: Mie govern- | govern- holds | prises | educational 
ments services 


a Central 





Central government 
Public ————— 
sector Provincial and local 
governments 








Private 
sector 


Non-profit institutions 





Enterprises 


Total Expenditure for financ- 
ing educational services 























a =likely to be significant 
b =moderate significance »Notes applicable to ECAFE countries. 
— =negligible 
Source: United Nations, Economic Commission for Asia and the Far East, Working Party on Economic De- 
velopment and Planning, Planning the Pattern of Public Social Expenditures, Document No. E/CN.11/DPWP.5/L.7, 
New York: United Nations, 4 August 1959. 


would need to be broken down by sectors for educational programmes. A sim- 
ilar table could be prepared for other programmes. 

A further breakdown of data on resources according to investment or current 
expenditure would also be of interest. The Manual for Economic and Functional 
Classification of Government Transactions® contains a full discussion of this 
aspect. 


Use of statistics to measure living conditions by means of indicators of the 
levels of living 3.2 


The purpose of social programmes is to raise the levels of living, and, in order 
to translate this purpose into positive action, it is necessary to express levels 





7 United Nations, Economic Commission for Asia and the Far East, Working Party on Economic Development 
and Planning, Planning the Pattern of Public Social Expenditures, Document No. E/CN.11/—-DPWP.5/L.7, New 
York: United Nations, 4 August 1959. 

* Op. cit. 
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of living in statistical terms. The United Nations Committee of Experts, men- 
tioned above, proposed in 1953, that the levels of living should be measured by 
reference to twelve components by means of a series of statistical indicators 
for each component. Recently, in 1959, an inter-agency Working Party on 
Statistics for Social Programmes® reviewed the components and indicators in 
the light of information that had become available since 1953, reduced the 
twelve components to nine, and proposed the following priority indicators: 


Component Health: 


1. Expectation of life at birth, 
2. Infant mortality rate, 
3. Crude annual death rate. 


Component Food Consumption and Nutrition: 


. National average food supplied, in terms of calories at the retail level, 
compared with estimated calories requirement, 
. National average food supplied, in terms of total proteins at the retail level, 
3. National average food supplied, in terms of animal protein at the retail 
level, 
. Per cent of total calories derived from cereals, roots, tubers and sugars. 


Component Education: 


1. Adult literacy rate, 

2. Total school enrolment ratio, 

3. Higher education enrolment ratio. 
Component Employment and Labour Conditions: 


1. Proportion of persons employed in the total labour force, 
2. Ratio of male labour force in agriculture to total labour force, 
3. Relative real wages in selected occupations. 
Component Housing: 
1. Per cent of the population living in “dwellings”, 
2. Per cent of occupied dwellings with three or more persons per room (over- 
crowding), 
3. Per cent of occupied urban dwellings with piped water. 
Component Social Security: 


[No indicators proposed as yet. | 


Component Clothing: 


[No indicators proposed as yet. | 


Component Recreation: 


[No indicators proposed as yet. } 





* United Nations, Administrative Committee on Co-ordination, First Technical Report of the Working Party 
on Statistics for Social Programmes (Geneva, 16-22 September 1959), Document No. ACC/WPSSP/1/9/Rev.1, 
New York: United Nations, 22 October 1959. 
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Component Human Freedoms: 


[No indicators proposed as yet. | 


The indicators are of various classes; some reflect achievement or ultimate 
aims of social programmes in very broad terms, as in the case of the expectation 
of life at birth; others are more directly related to the activities in the respective 
social programme, as in the case of the total school enrolment ratio. The indi- 
cators of housing levels of living belong to the first class. They are operational 
indicators which define aims of social programmes and they can also be utilized 
for estimating the order of magnitude of the programme needed in order to 
improve the living conditions in respect of housing. Some of the other indicators 
are not so useful for formulating programmes because, though reflecting cer- 
tain aspects of living conditions, their relationship to the possible actions in a 
particular social field are not well established. This seems to be the case with 
respect to the indicators of health which are not directly related to actions in 
the medical fields. For example, at present it appears impossible to determine 
what type of medical or health activities would be necessary in order to increase 
the expectation of life at birth by one year or reduce the infant mortality rate 
by five units, or the crude death rate by one unit. On the other hand, the rela- 
tionship between indicators of housing or education conditions and actions in 
these fields are understood and can be utilized in formulating relevant pro- 
grammes. There is much to be done in developing methods of measuring levels 
of living and particularly in developing “operational” indicators, i.e. those 
which can be used in formulating national or local social programmes. 


Use of statistics to estimate the social needs, 1.e. the supply and distribution 
of “social units” required for maintaining or improving the levels of living 3.3 


The indicators discussed above reflect the aims of social programmes by de- 
fining in statistical terms commonly accepted aspirations in each field. The 
next question that may have to be answered by the policy maker or program- 
ing officer is: what is the magnitude of the job to be done, in terms of dwellings 
to be built, food to be supplied and distributed, medical and educational 
services to be made available, by the public and private sectors, in order to 
maintain or improve prevailing living conditions? The methodological prob- 
lems involved in estimating social needs have been explored but not settled. 
It must be mentioned in this connexion that the Twentieth Century Fund 
Surveys,'® which include estimates for the United States for 1960, contain an 
impressive amount of information on this subject. 

Social needs must be related either to a point of time (e.g. a certain date), 
as in the case of estimates of the housing shortage, or to a certain time period, 
as in the case of food supplies. The volume of work to be done, i.e. the social 
programmes, however, can only be related to a period of time which may be 
one year, three, five or other number of years. A housing programme, for 
example, may be designed to absorb the existing housing shortage during a 5-, 
10-, or 20-year period and, consequently, the magnitude of the annual job to 





10 Dewhurst, Frederic J. and Associates, America’s Needs and Resources, A New Survey, New York: The Twen- 
tieth Century Fund, 1955. 
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be done will change in accordance with the length of the programme. The fol- 
lowing discussion is concerned with estimates of needs in terms of goods or 
services to be provided by the programme over a period of several years. 

The possibility of making objective estimates of social needs may be il- 
lustrated with reference to the field of housing. In this case the question is: 
how many dwellings and housing units of other classes!! would need to be built 
in a country or major geographic division, during a given period, in order to 
ensure that the percentage of people living in dwellings remains stable, or in- 
creases, and that both the density of occupation (persons per room) and the 
percentage of dwellings with three or more persons per room remain stable or 
decrease? Questions of this type have been answered for many countries at vari- 
ous periods; among others, for Australia, Belgium, Chile, Colombia, Denmark, 
France, Japan, Netherlands, Peru, Sweden, United Kingdom, United States, 
and Venezuela, and a review of the assumptions and methods used is being 
prepared in the United Nations Statistical Office. In making such estimates, 
statistics have been used in connexion with the following aspects of the problem: 


a. To estimate the number of dwellings that would be needed to house the 
population currently living in housing units considered unacceptable from 
a structural ‘point of view, e.g. huts, caves, “viviendas callampas”, etc., 
or who are without shelter of any kind, 
. To estimate the number of dwellings required to reduce the density of 
occupation, 
. To estimate the number of dwellings required to eliminate undesirable 


and involuntary doubling up of families and households, 

. To estimate the number of regular dwellings that need to be repaired or 
replaced because of their state of maintenance, 

. To estimate the number of dwellings that will need to be replaced because 
of obsolescence, disaster or conversion to non-residential use, etc. 
To estimate the number of dwellings required to house the future popula- 
tion increase, and the potential new households, over a period. 


Estimates of total needs have been obtained by combining the partial esti- 
mates indicated above or by other means. 

In the case of housing programmes the essential “goods” is the dwelling, and 
in this discussion only one type of dwelling has been considered. Other pro- 
grammes are concerned not with one type of goods or services but with the 
supply and distribution of several or many goods and services, separately or in 
alternative combinations. In order to estimate social needs it would be neces- 
sary to define which are the essential units in each case. For primary education 
programmes the essential unit may be the teacher, for health programmes there 
would be several units and the doctor might be one of them. The determination 
of which are the basic social units to be supplied or made available to the 
population in the case of each programme is an essential question which must 
be settled in order to estimate the volume or number of units needed and 
whether such needs could be satisfied with the available resources. 





11 The concepts of “dwelling” and “housing unit” used in this discussion are those defined in United Nations, 
General Principles for a Housing Census, Document No. ST/STAT/SER.M /28, New York: United Nations, Sales 
No. 58.XVIL8. 
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Use of statistics to test the compatibility of the estimated social needs with the 
expected economic and human resources 3.4 


It would be of little use to estimate the volume of activities necessary to 
maintain or raise the levels of living without verifying whether such a pro- 
gramme is feasible from the point of view of resources available. Such verifica- 
tion would need to be conducted in respect of financial resources; human 
resources (such as doctors, engineers or labourers); expected availability of 
material elements (such as drugs, vaccines, books, foods or building materials) ; 
and the capacity of the agencies and industries concerned to undertake their 
share of the programme. The compatibility of social needs and resources is 
tested by comparing the estimated needs with the corresponding resources. 
Information on the following three aspects would be necessary, with respect to 
each programme, in order to conduct such tests: 


a. What are the goods and services, i.e. the “social units” which need to be 
supplied to the population in order to maintain or raise the levels of living, 

b. How many “social units” are needed to produce a certain effect on the 
statistical indicators of the levels of living, 

c. What is the average cost of the “social units” involved in each programme. 


Information on these three aspects can be obtained in the case of housing 
programmes but it may be more difficult to obtain in the case of other pro- 
grammes. The “social units” would change from country to country according 
to circumstances, social organization and technical knowledge prevailing at the 


time the programme is formulated. The problem involved in obtaining the in- 
formation described in (b) is complicated by the fact that the same changes on 
certain indicators of the levels of living (e.g. crude death rate) may be obtained 
by the effect of different “social units”, or combinations of them, and that in the 
social field it is not normally possible to isolate the results of specific actions. 
However, this complication is essentially peculiar to health programmes whose 
objectives are vaguely defined by the indicators of the levels of health (see 
part 3.2). The solution might be facilitated by narrowing down the field of 
health to aspects directly and primarily related to medical or sanitary actions, 
or by conceptually sub-dividing the health component of the levels of living 
into “sub-components”. 

The testing of the compatibility of social needs and physical resources implies 
a knowledge of the physical inputs of various kinds that constitute a certain 
“social unit”. In the field of housing for example, it would be necessary to know 
how much of each class of basic materials, how many hours or months of work 
of non-skilled, skilled and professional manpower, etc., go into the construction 
of an average dwelling of a certain type. These inputs when multiplied by the 
number of dwellings needed to obtain the desired effect on the housing levels 
of living would yield estimates of physical needs for housing programmes 
which could then be compared with the available resources. 

The capacity to produce or import certain essential elements (for example 
building materials in the case of housing programmes, vaccines and drugs in the 
ease of health programmes, books in the case of education programmes, basic 
foods, etc.), as well as the availability of professional and skilled personnel, 


‘ 
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are essential limiting factors for social programmes, just as essential as the 
financial resources. The case of countries which could finance an adequate social 
programme but do not have enough teachers, doctors, engineers, nurses, or 
statisticians is not unusual. A much more frequent case is, of course, that in 
which both financial and physical resources are scarce. 

The availability of professional personnel has especial importance for social 
programmes, as can be illustrated by the following example taken from “The 
Approach of Operational Research to Planning in India” by P. C. Mahalanobis: 
“There are, at present, about 65,000 fully qualified (six-year trained) doctors 
in India 
India every year, and the cost of training each doctor is about forty or fifty 
thousand rupees. Under existing conditions it may take 60 or 70 years to pro- 
vide one doctor for every two thousand persons in the rural areas on an aver- 
age ...I1 have thought it desirable, therefore, to include in the Draft Plan- 
frame a proposal to bring some health service to every home in the country 
within a reasonable time, possibly in 10 or 15 years... , by establishing two 
new cadres of 2-year and 4-year trained health assistants as a first step to a 
national health service throughout the country. . . . One 6-year trained physi- 
cian would be in charge of a group of 5 or 6 health assistants; ... ”!? 

The testing of needs and economic resources in monetary terms can be made 
with respect to the total resources of the country or with respect to the house- 
holds’ income. The use of statistics for testing the consistency of national re- 
sources with the magnitude of social programmes is of great importance, since 
social programmes must be realistic but sufficiently adequate to ensure a 
gradual improvement of living conditions. The total economic resources needed 
for a social programme would be determined by relating to the number of 
“social units” needed their unitary cost. The aggregate cost of a national pro- 
gramme may then be compared with the resources estimated to be available 
for that programme (see part 3.1). 

The following example taken from the field of housing will serve as an illus- 
ration: In a report on housing in Peru’ it is estimated that it would be neces- 
sary to construct 2,339,700 dwellings and to repair some 1,011,500 over a 30- 
year period, beginning on 31 December 1956, in order to improve the housing 
conditions to certain levels now considered acceptable in that country. This 
would represent an investment of approximately 60,000 million soles at 1956 
prices, as indicated in Table 3. 

The total cost of the programme is related to the national income as follows: 
“The cost of the housing programme, excluding communal services, has been 
estimated to be 60,000 million soles, to be invested in 30 years. Since the na- 
tional income and, parallel with it, the capacity for dwelling construction will 
increase during this period, the most convenient form of distributing such in- 
vestment over the 30-year period is by equating it to a fixed proportion of the 
national income. 





12 Mahalanobis, P. C., “The Approach of Operational Research to Planning in India,” Reprinted from Sankhyd, 
The Indian Journal of Statistics, Vol. 16, Parts 1 anc 2, Chapter 5, para. 7, Calcutta: Statistical Publishing Society, 
1955. 

18 Comisién para la Reforma Agraria y la Vivienda, Informe sobre la vivienda en el Peré, Lima, Peri: Imprenta 
Casa Nacional de Moneda, 1958. 
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TABLE 3. COST OF 30-YEAR HOUSING PROGRAMME, 
BEGINNING 31 DECEMBER 1956 





New Aaidiade to be constructed | is E sin iiilinis to be repaired 


























| 
5 Cost per | Total in- | Cost per | Total cost 
— Number of | dwelling | vestment | Number of| dwelling | of repairs 
dwellings (Soles (Soles dwellings (Soles (Soles 
1956) 1956) 1956) 1956) 
(millions) (millions) 
Metropolitan | 657 , 200 40 ,000 26 ,288 68 , 100 10 ,000 681.0 
Urban | 669 , 400 30 ,000 20 ,O82 255 ,800 7,500 1,918.5 
Rural | 1,013,100 10 ,000 10,131 687 ,600 2,000 1,375.2 
Total |, 2,339,700 56,501 1,0 ,011,500 3,947.7 








Source: Comisi6n para la Reforma Agraria y la Vivienda, Informe sobre la vivienda en el Peri, Lima, Perd: Im- 
prenta Casa Nacional de Moneda, 1958. 


“On this basis and on the assumption that both the population and the 
national income will increase following a geometric progression, with annual 
rates of 2.1 and 5 per cent, respectively, the annual investment can be esti- 
mated. 

“The investment for the first year would be 900 million soles, and this 
amount would be increased by 5 per cent per year, that is, in the same propor- 
tion that the national income increases in constant prices. In this form 60,000 
million soles would be invested over the 30-year period. 

“Considering that the national income in 1957 was of the order of 25,000 
million soles .. . and that the available proportion for net investment was 16 
per cent (average for the last five years), the resources for investment would be 
4,000 million soles. The 900 million investment needed for the first year of the 
housing programme represent approximately 3.6 per cent of the national in- 
come and 22.5 per cent of the net investment, which are evidently very high 
percentages.” !4 

The use of statistics to determine the extent to which households are able 
to finance their essential social needs out of their regular income is quite com- 
mon. The question to be answered may be proposed in the following form: 
Given a certain average level of income, a distribution of households by income 
groups and corresponding patterns of expenditure, what proportion of the 
households could, or could not, afford to purchase the “social units” considered 
in social programmes (housing, food, education, medical services, etc.) at cur- 
rent or expected prices? For example, the question of whether households can 
afford a type of minimum satisfactory dwelling is normally explored in the 
formulation of national housing programmes. 

In a fairly recent study on the economic significance of housing in Central 
America and Panama, it is estimated that approximately 46 per cent of the 
urban families in El Salvador could not pay more than 70-88 per cent of a 
monthly rent equivalent to 1/100 of the average cost of a dwelling, such rent 





4 Jbid., pp. 57-8 (Unofficial translation). 
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being considered adequate to finance the dwelling construction, (“economic 
rent”). In another study dealing with the housing situation in Colombia, 
about 20 per cent of urban families were found unable to pay a rent equivalent 
to the amount needed to finance the construction of a minimum acceptable 
dwelling in forty years.'® 

The form in which the information on household income and expenditure 
may be used in the formulation of social programmes deserves separate con- 
sideration, as, indeed, do all the other major uses of statistics briefly examined 
here. It is not possible to go any further into this matter in this discussion. 


Use of statistics to select the most convenient combination of activities, t.e. a 
social programme 3.5 


After social needs are estimated and matched with the estimates of available 
resources, the next logical step would be to attempt to bring needs and re- 
sources into balance by adopting feasible targets or goals for social programmes. 
Such targets could not be entirely objective since they would be based on as- 
sumptions regarding future population growth,'” national income and relation- 
ships between social measures and levels of living, and would be the result of 
compromise between desire for improvement of one aspect of the levels of living 
(e.g. housing), the need to maintain an even rate of improvement in other 
components and the need to maintain a sustained economic growth. The targets, 
furthermore, should be revised periodically as new information becomes avail- 
able. The statistical bases on which the targets would be established would be 
known and could be examined by anybody interested in the subject, a fact 
that should have considerable political significance, especially in under-devel- 
oped countries. 

The resources may be found to be smaller or larger than the needs fora 
certain social programme. Normally the balance of needs and resources for any 
programme will vary depending on the type of resources. There may be suf- 
ficient labour and insufficient building materials, sufficient financial resources 
and insufficient professional personnel, and so forth. Whatever the situation 
may be, the use of statistics in exploring the possibilities of equating needs 
and resources would be essential. 

If after exploring all the possibilities, the resources are insufficient even to 
maintain the existing levels of living, statistics can also be used to evaluate the 
consequences of such a situation. In the case of housing, to mention a very real 
example, statistics could be used to estimate the extent which the housing 
shortage would increase in the event that an insufficient number of dwellings 
were built over a period of years. Statistics, however, would not be very helpful 
in judging the political and social consequences of allowing housing conditions 
to deteriorate consistently over a period of several years. 





4% United Nations, Economic Commission for Latin America, Significacién econémica de los programas de vi- 
vienda en Centro América y Panamé, Document No. E/CN.12/CCE/AC.6/5, New York: United Nations, 1 No- 
vember 1957. 

18 Corporaci6n Nacional de Servicios Pablicos, Departamento de Vivienda, Déficit y demanda de vivienda en 
Colombia, Series Estudios Socio-Econémicos No. 3, Bogot&é, Colombia, 1956. 

17 Implied in projections of national income as well as in the estimates of future needs. 
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4. USE OF STATISTICS FOR THE EVALUATION OF SOCIAL PROGRAMMES 


While the formulation of a social programme is an a priori operation, oriented 
to the future, the evaluation must be made a posteriori, and should, therefore, be 
based on past experience. 

In this discussion a social programme has been considered to be composed of 
public and private activities, whether or not consciously oriented. When the 
programme has been established by a systematic procedure, such as the one 
broadly outlined in part 3 of this paper, it could be evaluated in the follow- 
ing ways: 

a. By measuring the ultimate results of the programme by the observed 

changes in the indicators of the levels of living, 

b. By testing the assumptions made in formulating the programme as re- 
gards the expected behaviour of the households, future population growth 
national income and expected results of social measures in the light of 
observed facts, 

c. By assessing the administrative efficiency of the agencies concerned. 


The measurement of levels of living by means of statistical indicators has 
been discussed in part 3.2 and the evaluation of assumptions, “laws” and 
theories or of the administrative efficiency of agencies cannot be approached 
in a general way. Such evaluations can only be made with reference to the 
particular assumptions, laws etc. related to individual programmes. In the 
final analysis, the only comprehensive yardsticks of the success or failure of 


social programmes are the statistical indicators of the levels of living. 








CONSUMERS’ PROPENSITIES TO HOLD LIQUID ASSETS* 


Haroip W. GuTsrie 


Cowles Foundation for Research in Economics 
at Yale University 


There is wide variation in the need for and the ability to accumulate 
liquid assets. The model used for investigating liquid asset holdings at- 
tempts to isolate the precautionary motive from other motives for 
holding liquid reserves. The propensities for holding liquid assets for 
precautionary purposes are analyzed by classifying consumers according 
to their long-run income position. Within each class regression equa- 
tions are used to test hypotheses concerning the relationship between 
liquid balances and age, size of consumer unit, level of current income, 
and home ownership. 

The results suggest that the propensity to hold liquid assets (1) is 
low for consumers who can expect long-run increases in income, (2) is 
higher for consumers who have permanently low incomes, but that 
these consumers may feel that their reserves are inadequate, (3) in- 
creases with age to a limit, and (4) decreases as the consumer unit 
becomes larger. 


HE effect of liquid asset holdings on consumers’ saving, once regarded as a 

persistent and simple negative relationship, has not been clearly demon- 
strated in empirical tests. The results of some tests using cross-section data 
have suggested that the asset effect is variable over time and that it may have 
both positive and negative signs. Studies by Klein [4, pp. 232-8] and Tobin! 
indicate that although liquid asset holdings had a relatively strong positive 
effect on durable goods expenditures in 1948, the effect was weaker in 1949 
and had disappeared by 1951-2. Katona [3, pp. 167-70] has suggested that 
there are two different asset effects. Some consumers with high asset holdings 
may feel relatively saturated with assets and therefore save less than those 
with low asset holdings. Other consumers with high asset holdings may have 
formed a habit of saving, or may be more thrifty, and therefore save more 
than those with low asset holdings. Tobin and Morgan [7, pp. 124-6] found 
that the sign of the asset effect depends on income change. According 
to Morgan, saving was positively associated with liquid asset holdings when 
the consumer’s income had increased substantially and negatively associated 
with liquid asset holdings when his income had declined substantially. 

The variety of hypotheses implicit in these findings suggests the need for an 
examination of liquid asset holding by consumers. Why do liquid asset holdings 
differ between consumer units? Why do they vary over time for given consumer 
units? Knowledge of the needs fulfilled by liquid asset balances seems to be a 
logical basis for hypothesizing the effect of the balances on saving. The following 
sections describe the results of an attempt to discover why some consumers 





* Financial support for the research reported here was provided by the Yale Workshop in Quantitative Eco- 
nomic Research under a grant from The Ford Foundation. The Survey Research Center, University of Michigan, 
and The Board of Governors, Federal Reserve System, made available the punch cards from Surveys of Consumer 
Finances. I am grateful to these institutions for their assistance. Special acknowledgment is due to James Tobin for 
helpful suggestions and to Harold W. Watts for assist in progr ing calculations. 

1 James Tobin has kindly permitted reference to results of research not yet published. 
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hold larger liquid asset balances than others and to explain the differences in 
levels of liquid assset holdings among specified kinds of consumers over time.” 


1, THE MEASUREMENT OF LIQUID ASSET HOLDINGS 


Consider two consumers, A and B, each holding $1,000 in liquid assets. Sup- 
pose that A has an annual income of $1,000 while B has an annual income of 
$10,000. Although each has saved the same absolute amount it is clear that the 
balances have been accumulated under different circumstances. The wide 
difference in incomes indicates that the two consumers probably have quite 
different standards of living. Furthermore it is reasonable to assume that A and 
B differ in their perceptions of need for liquid assets. 

The two consumers can be distinguished by expressing their liquid balances 
as a per cent of annual income. The ratio of liquid assets to income for B is 
10 per cent; the ratio of liquid assets to income for A 100 per cent. Thus the 
ratio of liquid asset holdings to annual income is a more meaningful measure 
than absolute balances for an analysis of liquid asset holdings. Hereafter it will 
be referred to as the LA ratio. 

The LA ratio has been used as an explanatory variable in several studies of 
saving functions. The LA ratio is chosen over the absolute level of liquid asset 
holdings in saving functions primarily on grounds of partial elimination of the 
positive relationship between asset holdings and income, and the reduction of 
variance in saving by dividing both sides of the saving function by income. 
Both of these reasons for dividing liquid asset holdings by income are relevant 
to the present analysis, but perhaps even more important is the need for a 
measure of asset holdings which is appropriate for an investigation of the con- 
sumer’s rationale for holding liquid assets. Assuming that the consumer’s desire 
to hold liquid assets is related to his general standard of living, which is approxi- 
mated by his income in a given year, the LA ratio measures liquid assets in an 
appropriate context. 

The data analyzed were gathered in Surveys of Consumer Finances by the 
Survey Research Center, University of Michigan, on behalf of the Board of 
Governors, Federal Reserve System. Liquid assets are defined as assets of fixed 
money value which can be converted to money quickly and conveniently. 
Liquid assets, as measured in the surveys, consist of holdings at the end of a 
given year in the form of savings deposits in banks and other savings institu- 
tions, checking accounts, and U. 8. bonds. Currency and the cash surrender 
value of life insurance policies, although liquid in the economic sense, are not 
included in the surveys for technical reasons. Total income includes cash in- 
come received by all members of the spending unit. 

Each interview is assigned a numerical weight to correct the data for vari- 
able sampling rates between income groups and for differences in response 
rates. These weights have been used in all calculations presented here. 


2. THE RATIONALE OF HOLDING LIQUID ASSETS 


The usual motivations for holding money can also be ascribed to the holding 
of liquid assets. Spending units hold liquid assets for transaction purposes, for 





2 The important question of aggregation, the relationship of the various explanations discovered through micro- 
analysis to the aggregates of liquid asset holdings, is not considered here. 
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speculative purposes, or for precautionary purposes, and all three motivations 
might apply for many families. This analysis of LA ratios among spending units 
cannot distinguish clearly between these motivations but it can lead to some 
general conclusions about the relative importance of the precautionary motive 
between different kinds of spending units. 

Except for differences in the propensity to consume between consumers, dif- 
ferences in the LA ratio should not show the effect of the transaction motive. 
Two consumers who receive different monthly salaries, each consuming his 
total salary from a checking account at a constant rate will each have an aver- 
age LA ratio of 1/24 over the month. One month can be considered the longest 
period for which wage payments are made in practice. Moreover, there may 
be economies of scale in transactions for the higher income families which 
would result in transaction ratios of less than 1/24. Therefore, 1/24 approxi- 
mates a maximum LA ratio for transaction purposes. The average LA ratios 
observed for many spending units in the Surveys of Consumer Finances are 
much larger than 1/24, even when only checking account balances are used in 
the numerator. The empirical analysis therefore assumes that observed dif- 
ferences in LA ratios cannot be attributed to the transactions motive. 

It is more difficult to eliminate the effect on the LA ratio of assets held for 
intended consumption or intended investment in securities or commodities. At 
a given point of time one consumer may hold $1,000 for unforeseen contin- 
gencies. Another may hold the same amount for an intended purchase of a 
house. It is not possible to distinguish between the two motives by observation 
of the consumers’ LA ratios. 

The influence of assets held for intended consumption or investment can be 
partially identified, however, by comparisons over time. The empirical results 
reported here compare LA ratios for specified types of spending units in 1947 
with the LA ratios for similar—but not identical—spending units in 1953. 
During the intervening period the supply of consumer goods increased rapidly, 
there was a boom in real estate and common stock, and interest rates on liquid 
assets were held at low levels through most of the period. It is reasonable to 
assume that assets held for intended consumption or investment had little in- 
fluence on LA ratios in 1953 compared to their influence in 1947. The forced 
saving during World War II resulted in a disequilibrium with respect to liquid 
asset holdings in 1947 relative to 1953. 

Still another type of motive for holding liquid assets must be considered, a 
type that is less rationalistic than the others, but one that is subjectively im- 
portant to the holders of the assets. There are many people who hold liquid 
assets because they are unaware of investments bearing a higher yield, because 
they are averse to any risk of money value, or simply because they do not think 
of non-liquid assets as being marketable. The last point seems to apply to a 
substantial number of respondents who, in a recent study conducted by the 
Yale Workshop in Quantitative Economic Research, were asked, “What is the 
best kind of asset to pass along (in an estate) to your children?” Many of 
the replies mentioned various forms of liquid assets. The reason for this choice 
was typically that money is easily divided and family quarrels are avoided. 
These motives must be acknowledged but we shall consider them to be sub- 
sumed under the precautionary motive, the desire to have money conveniently 
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at hand when emergencies arise. If emergencies do not arise the assets fulfill a 
residual function in yielding a small but sure return and as a store of money 
value for consumption during retirement or for bequests. 

The empirical analysis of the relative strength of the precautionary motive 

among certain groups of consumers is based upon the following postulates: 

1. During a given year differences in the LA ratio between groups of con- 
sumers cannot be attributed to the transactions motive. 

2. Because of forced saving during World War II the year 1947 is “ab- 
normal” with respect to liquid asset holdings. Differences in the LA ratio 
between groups of consumers arise from differences in the precautionary 
motive and from differences in intended expenditure. 

The year 1953 is “normal,” relative to 1947, with respect to liquid asset 
holdings, and differences in the desires for precautionary reserves account 
for large differences in the LA ratio. 

Differences in the LA ratio between 1947 and 1953 for similar consumer 
groups indicate whether or not the LA ratios of 1947 were “normal” 
and therefore held primarily for precautionary purposes—or “abnormal” 
—and therefore held for both precautionary purposes and intended 
expenditure. 


3. THE HYPOTHESES TESTED 


The hypotheses were developed after examining LA ratios for 1947 and 1951 
[2]; the tests have been performed on data for 1947 and 1953.3 The formulation 
of the hypotheses depended heavily on a consideration of the outcome of the 
conflict between the need and desire for consumption versus the need and de- 
sire for precautionary reserves. The outcome is expected to differ according to 
the number of persons in the spending unit and the position which the spending 
unit occupies in a profile of income over the life span of the head of the spend- 
ing unit. 

The preliminary examination of the LA ratios in 1947 and 1951 suggested 
that single-person spending units were so markedly different from multiple- 
person spending units that they should not be combined in a hypothesized 
continuum. Family formation apparently results in pressures to consume which 
lead to a lower propensity to hold liquid assets. Single-person spending units 
have therefore been separated from multiple-person spending units for empiri- 
cal analysis. 

The preliminary examination of LA ratios also suggested that the propensity 
to hold liquid assets varies with income in very complex relationships. To ac- 
count for the income effect in the present study three separate income classes 
are defined by the occupation, education, age, race, and regional location of 
spending units. The latter variables define groups of spending units and the 
mean income decile has been computed for each group. 





3 Data for the latter years are contained in the 1948 and 1954 Surveys of Consumer Finances, respectively. 
The data for 1947 were used both to develop and to test the hypotheses. Although this is not good statistical practice, 
data for 1946 were not fully comparable with data for subsequent years, and by 1948 consumers were well advanced 
in the process of adjusting to the postwar disequilibrium. Statistical purity was therefore sacrificed for a sharper 
economic effect. 
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For example, one group consists of white spending units living in the South, 
the heads of which were skilled workers with a high-school education and were 
between 35 and 44 years old. For each spending unit in the group there is an 
income decile position, from 1 (low) to 10 (high), which is determined by the 
relative position of his absolute income in an array of incomes for all spending 
units. The weighted mean of the individual income deciles within a group is the 
basis for combining groups of spending units into income classes. The income 
decile is used for classification rather than absolute income because relative 
income is assumed to be more stable over time. This comparative stability is 
also important in the regression model explained below. The groups were com- 
bined into classes by examining the mean income decile over the age span for 
each race-region-occupation-education group. If the mean decile is less than 
7.5 for all ages the entire race-region-education group is classed as “Low.” If the 
mean income decile is less than 7.5 for the younger age groups but 7.5 or more 
for at least one of the older age groups, the younger groups are classed as 
“Temporarily Low.” The groups which have mean income deciles of 7.5 or 
more, or which are older than such a group, are classed as “High.” The following 
diagram illustrates the classification procedure: 


Mean 
Income Low: AB 

Decile Temporarily low: CD 
High: DEFG 














Age 


The income classes are intended to reflect the spending units’ income ex- 
pectations. The “High” class includes spending units who can expect to main- 
tain a high relative income position. The “Low” class is composed of spending 
units whose status is such that they could not expect to rise above a decile 
position of 7.5. The “Temporarily Low” class consists of young people whose 
current relative income position is similar to the position of those in the “Low” 
class, but who can expect to rise to the upper levels of relative income as they 
grow older. A table showing the mean income decile position for all race-region- 
occupation-education-age groups is shown as an Appendix. 

It is possible that spending units who have different income expectations 
will not react in the same way to given changes in current income. In order to 
measure differences in the effect of current income on the LA ratio between in- 
come groups, the observed value of income decile is used as an independent 
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variable in regression analyses for each expected income group. The use of in- 
come decile rather than measures of absolute income allows direct comparison 
of income effects between 1947 and 1953. 

There are strong a priori grounds for expecting consumers to hold more liquid 
assets for precautionary purposes as they grow older. First, the need for such 
reserves increases with age because the threat of adverse economic events (e.g., 
illness, disability, loss of favorable employment opportunities) is greater for 
older people than for younger people. Second, the positive relationship between 
income and age gives older people the ability to accumulate liquid reserves. 
Finally, the preference for holding liquid assets vs. consumption may be ex- 
pected to become stronger as the pressures for consumption decline with age. 
On all three points--need, ability, and preference—precautionary reserves 
should increase with age. The results of the tests are consistent with this 
hypothesis. 

As the size of a spending unit grows larger, the need for precautionary 
balances should increase. An adequate reserve for medical expenses and other 
contingencies for families of several persons may be expected to be greater than 
the reserve needed for smaller families. The pressure for current consumption 
also increases with family size, however, and the preference which families ex- 
press between the satisfaction of desires for current consumption and the secu- 
rity of the desired level of liquid reserves is not obvious. Let us posit a preference 
for consumption as the stronger of the two demands. There is then a conflict 
between need and preference which is to be resolved by ability, the income 
position of the spending unit. As high income spending units increase in size, 
they are able to fulfill their consumption demands and also maintain the desired 
level of precautionary reserves. But as low income spending units grow larger, 
their needs for current consumption have priority over the demand for liquid 
reserves. It is expected that they will increase their liquid balances only as their 
incomes increase. The empirical tests support the posited preference for con- 
sumption over security as family size increases. The data do not support the 
hypothesis that the income position of the spending unit affects the relation- 
ship between liquid reserves and size of spending unit. 

Although this analysis concentrates on liquid assets as one of the more im- 
portant forms of wealth held by consumers, the size of holdings in other forms 
ought to affect liquid balances. The high frequency of life insurance ownership 
seems to indicate that life insurance is regarded as a favorable substitute for 
other forms of wealth. The available data permitted testing only the simple 
hypothesis that precautionary balances are negatively related to the spending 
unit’s equity in an owner-occupied home. Since it is reasonable to assume that 
home ownership satisfies a desire for security this hypothesis tests for one 
relevant kind of substitution effect. The results of the test indicate that there 
is not a general substitution effect between home ownership and liquid balances. 


4. THE DATA AND THE REGRESSION MODEL 


The original samples drawn for Surveys of Consumer Finances represent the 
entire consumer sector with minor exceptions, e.g., institutional and transient 
population. Several segments of the original samples have been omitted from 








PROPENSITIES TO HOLD LIQUID ASSETS 475 


this analysis. Farmers and owners of unincorporated businesses have been 
omitted because it is difficult, and in some cases impossible, to separate business 
assets from household assets. Although non-white spending units had LA 
ratios which were significantly lower than LA ratios among white spending 
units, all hypotheses attempting to explain differences in LA ratios among 
non-whites were rejected, and these spending units were excluded from further 
analysis. Similarly, spending units in which the head is retired have LA ratios 
significantly higher than the LA ratios of other spending units, but hypotheses 
formulated to explain differences in the LA ratios had to be rejected. Retired 
spending units were therefore omitted. A small number of spending units have 
LA ratios which, because their incomes are very low, are extremely high relative 
to others. To avoid the distorting influence of these extreme values, LA ratios 
of 1,000 per cent or more have been omitted. It was also necessary to omit 
those spending units for which values of one or more of the independent vari- 
ables were not ascertained. 

After these omissions the remaining observations were classified as follows 
according to the procedures described above: 

















Number of cases 
Class SS 
1947 1953 
Single-person spending units: 

High income 86 100 
Temporarily low income 78 79 
Low income 313 266 

Multiple-person spending units: 
High income 520 584 
Temporarily low income 188 231 
Low income 941 678 
1,938 


Total | 2,126 





For each of the six classes and for 1947 and 1953 separately a regression equa- 
tion of the following form has been calculated: 


LA 
100 YY = bo oe b,A a boN ote b3;D a b,D? oa b;(OA) a Uu 


where: 


100 LA/Y =the LA ratio described above, expressed in ver cent 
A =age of head of spending unit in the scale: 
1=age 18-24 
2=age 25-34 
3=age 35-44 
4=age 45-54 
5=age 55-64 
6=age 65 and over 
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N=number of persons in the spending unit except that N =0 for 
single-person spending units 

D=value of income decile for the individual spending unit regard- 
less of its income class 

D? = squared value of D 

O =home ownership =1 if owns home 

0 if does not own home 
u=residual 


The curvilinear form for the influence of D on the LA ratio fits the U-curve of 
LA/Y plotted against D for all spending units. The OA variable is a proxy for 
equity in an owner-occupied home assuming that equity is positively related 
to the age of the owner. 


5. ANALYSIS OF THE LEVEL OF THE LA RATIO IN 1953 


Table 1 shows the mean value of the LA ratio and the regression coefficients 
in 1953 for each group. Table 2 shows an analysis of variance of the level of the 
LA ratio within classes with the regressions not taken into account. Table 3 
shows the F ratios which test for significant net differences in regression co- 
efficients between classes, accounting for the joint effect of all regression vari- 
ables and coefficients. Table 3 tests the hypothesis that the variance explained 
by separating classes is equal to the remaining variance of u after separation 
of classes. Both Tables 2 and 3 test for differences between single- and multiple- 
person spending units, and between the three income classes. 


TABLE 1. VALUES OF REGRESSION COEFFICIENTS AND MEANS OF 
THE LA RATIO IN 1953 FOR SPECIFIED CLASSES 
(standard errors shown in parentheses) 





Regression Coefficients 
Mean —<—_——————__—_—_—_——— 
sA Ratio P Decile Home 
J . Jecile 
Age Number Decile cqueed Oveauhlp 





Single-person Spending Units: 
High income 56 . 52 5: 22 .63* - 1.35 ~—11.38* 
47) (18.85) (1.42) (4.91) 


Temporarily low income 25. 53. 4.10 - —21.83* 
.94) (10.69) 


Low income 2.6 54.73 3.66* - .31* 
.05) 


Multiple-person Spending Units: 
High income 33. 37 . 22 .41* —6.76* 9.90 ; .57 
2.65) (1.75) . 56) . 53) .34) 


Temporarily low income 9.3: , .08 —3.06* -15 -26 — .25 
.38) 2.15) ( .86) 2.57) .20) (1.05) 


Low income 28.56 11.38* —6 .09* — 7.05 -48 2.64 
(18.68) (2.44) (1.85) ( 5.60) ( .47) (1.54) 











1 Not calculated because there were no home owners in the class in 1947. Consistent procedures were used in 
both years to allow comparisons between years. 
* Regression coefficient significantly different from zero at a level less than 5 per cent. 
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TABLE 2. ANALYSIS OF VARIANCE FOR DIFFERENCES BETWEEN 
LEVELS OF THE LA RATIO IN 1953 








Degrees of 


Sum of § > 
_ a Freedom 


Estimated Variance 
Source of | 


Variation 
t Ex- 


Unex- Used Re- Ex- Unex- 


plained plained maining} plained plained 








Spending Unit | 277,566 11,823,179 1,936 | 277,566 6,107 
Income 178,425 11,922,319 1,935 | 89,212 6,161 

















** Significant difference at the 1 per cent level. 


The mean LA ratio is higher among single-person spending units than among 
multiple-person spending units. Moreover, the regression coefficients indicate 
that a reduction of the LA ratio is associated with each additional person in the 
spending unit. A tentative conclusion based on 1953 data is that as family size 
increases, consumption (and other forms of saving) gains in preference over 
holding liquid assets. 

The significant difference between income classes shown in Table 2 is ac- 
counted for, in part, by the low LA ratio for the temporarily low income group. 
By definition this group contains relatively young spending units, however. 
The regression coefficients indicate that the LA ratio is positively related to 
age in both high and low income groups. The low LA ratio among spending 
units with temporarily low incomes is probably explained in part by their age. 

According to Table 3 there are no significant net differences in the regression 
coefficients between classes when all coefficients are tested jointly. Inspection 
of the individual coefficients in Table 1 suggests, however, that the effect of the 
income decile position is different between classes. The income coefficients in 
the multiple-person spending unit classes are not significantly different from 
zero. Among multiple-person spending units, regardless of income class, the LA 
ratio is not affected by income differences. Similarly in the high income, single- 
person spending unit class, the LA ratio is not affected by income decile posi- 
tion. In both low income, single-person spending unit classes there is an income 
effect, however. Those who have temporarily low incomes show declining LA 
ratios as their income position is improved. Spending units who, by their 


TABLE 3. F TESTS FOR DIFFERENCES IN REGRESSION 
COEFFICIENTS BETWEEN GROUPS, 1953 








Sums of squares of u Degrees of Freedom 





Groups lained Used 

Tested Unexplaine Ex- Re- 

plained Pooled Sep- Net | M#ining 
arate 








Pooled Separate 





Spending Units | 10,711,883 10,673,744 | 38,139 6 11 5 1,926 
Income 10,711,883 10,668,707 | 43,176 6 18 12 1,919 
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occupation-education characteristics, have been classed in a permanently low 
income group show changing preferences for liquid assets as their income in- 
creases. In the lower range of income deciles the LA ratio is reduced as income 
increases, but with further increases in income the LA ratio rises. 

In only one class, the high income, single-person spending units, is there any 
evidence that equity in a home substitutes for liquid asset holdings. The vari- 
able for measuring equity is crude, and it yields no information about the mag- 
nitude of the substitution effect. Nevertheless, it is interesting to note the oc- 
currence of the substitution effect only in the class in which the need for liquid 
asset holdings is relatively low and the ability to save is relatively high. The 
data suggest that, except for the very minor group noted above, consumers do 
not shift between asset forms while maintaining some normative level of secu- 
rity in total wealth. Rather it is possible that each form of asset has a desired 
normal level and there is no “portfolio problem.” 

The analysis of the level of the LA ratio in 1953 suggests the following tenta- 
tive conclusions. Once a household of more than one person is established the 
LA ratio is determined by the age of the head and the size of his family. Further- 
more, there are no economically significant differences in the effect of these 
variables between the high and low income classes. The propensity to hold 
liquid assets among single-person spending units varies with their income class, 
however. Those in the high income class have a relatively high average LA 
ratio but individual LA ratios are not associated with differences in the income 
decile. There is some evidence of satisfaction with the level of the LA ratio in 
this group in the tendency to let equity in a home substitute for higher LA 
ratios. Those who can expect high income, the temporarily low income class, 
have a low propensity to hold liquid assets. But additional income is evidently 
diverted primarily to consumption and, possibly, other forms of saving, indi- 
cating satisfaction with the low LA ratio. The single-person spending units 
who have low incomes and cannot expect to rise to high income positions show 
a latent unsatisfied demand for higher liquid balances. Their average LA ratio 
is comparable to the average LA ratio in the high income class but it tends to 
increase in the higher range of income deciles.‘ 


6. ANALYSIS OF DIFFERENCES IN LA RATIOS BETWEEN 1947 AND 1953 


A dynamic analysis of the change in the LA ratio over time for identical 
spending units—if it could be performed—might reveal clearly the motivations 
for holding liquid balances. The data available do not describe identical spend- 
ing units in two years. If we regard the motivations for holding liquid assets as 
a set of preferences between security and current consumption—the preferences 
varying with income expectations and family strueture—we may expect to find 
differences in behavior over time between similar spending units. We may ex- 
pect that different kinds of spending units reacted to the relatively large liquid 
balances accumulated during World War II according to differing preferences. 
It is possible that the forced saving of the early 1940’s produced abnormally 
large liquid balances for some spending units. To other spending units the war- 





4 An alternative interpretation is that only those few persons who attain higher income levels are concerned. 
As a result of a desire for higher liquid balances they may work harder than others, work overtime, or hold more than 
one job in an effort to raise their LA ratio. 
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TABLE 4. MEAN VALUES OF THE LA RATIO FOR SPECIFIED 
CLASSES OF SPENDING UNITS, 1947 AND 1953 


Class 








1947 1953 % Change 








Spending Units 
N=1 
N>1 


Income Groups* 
High 
Temporarily Low 
Permanently Low 


Spending Unit and Income 
(High 

N= 1) Temporarily Low 

{Permanently Low 


56. 
25. 


[High f 33.2 
N>1; Temporarily Low ; 9.3 

| Permanently Low 35. 30.0 
All Spending Units* 


41.2 34.0 














* Differences in mean LA ratios between years are statistically significant; see Table 5. 


time accumulations may not have seemed abnormal even though they had 
never held such large liquid balances previously. Given that the consumer 
sector held unprecedented liquid balances at the end of World War II, for 
which kinds of consumers were these balances abnormal in terms of their 
motivations to hold liquid assets? One kind of answer is found by comparing 
the levels of the LA ratios between 1947 and 1953 for different classes of spend- 
ing units. 

Table 4 shows the mean LA ratios in the two years for specified classes of 
spending units. Table 5 shows the results of an analysis of variance of the LA 


TABLE 5. ANALYSIS OF VARIANCE FOR DIFFERENCES BETWEEN 
LEVELS OF THE LA RATIO, 1947 AND 1953 








Source of 
Variation 


Sums of Squares 


Ex- 
plained 


Degrees of 
Freedom 


Estimated 
Variance 





Unex- 
plained 


Re- 


Used on 
maining 


Ex- 
plained 


Unex- 
plained 





Years (7) 

Spending Unit 
(SU) 

Income (Y) 

TXSU 

eo 8 i 





51,900 
366 ,015 





27 ,024 ,006 
26 , 709 ,892 


26 ,803 , 134 
26 ,642 ,728 
26,700,410 


* Significant difference at the 5 per cent level. 
** Significant difference at the 1 per cent level. 





4,062 
4,062 


4,061 
4,060 
4,058 


51,900 
366 ,015 


136 ,386 
15,263 
25,413 





6,653 
6,576 














480 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1960 


ratio for different groups of spending units when time is introduced as an inde- 
pendent variable. Significant differences in mean LA ratios between 1947 and 
1953 occur for all spending units {7'), and for spending units classified by in- 
come (TX Y).5 

Thus there is a significant decline in the LA ratio for the entire consumer 
sector, but the high income class and the temporarily low income class account 
for the decline. The trivial change in the LA ratio among permanently low in- 
come spending units suggests that their 1947 liquid balances were not ab- 
normal. The needs and preferences for liquid assets were evidently stronger 
among these people than among those who had, or could expect, high incomes. 
If these results evolved from an analysis of identical spending units we would 
conclude that permanently low income people maintained their post-war liquid 
reserve position more tenaciously than others. Given similar—but not identical 

spending units in the two years, the differences indicate only that the moti- 

rations for holding liquid assets at the 1947 level were stronger in 1947 among 
low income groups than among high income groups. 

Table 6 shows the results of F tests for significant net differences in the re- 
gression coefficients between years, accounting for the joint effect of all regres- 
sion variables and coefficients. Table 6 tests the hypothesis that the variance 
explained by separate regressions for the two years is equal to the remaining 
variance of u after separation of the years. The significant net differences be- 


TABLE 6. F TESTS FOR DIFFERENCES IN REGRESSION 
COEFFICIENTS BETWEEN YEARS 





Sums of Squares : 
q Degrees of Freedom Used 








Unexplained Explained 
by . Years 

Years Years Separate Years Sepa- : Re- 
Pooled Separate Years rate 


Groups Tested 





ive ss 
Pooled maining 





All Units 24,000,614 23,876,801 | 123,813 2 6 4,051 


N=1 
N>1 


Spending Units 
| 9 ,897 ,458 ‘ 415 | 120,044 

13,884,898 13,823,72 61,172 

Income | 
High 4,664,121 4,524,978 | 139,143 
Temporarily low | 589 ,995 563 ,787 26,208 
Permanently low | 18,635,921 18,601,222 | 34,699 


Spending Unit and Income Groups | 
(High 

N =]; | Temp. low 

| Perm. low 


1,534,425 1,367,453 | 166,972 
266 ,820 255 ,316 11,504 
7,978,182 7,925,230 52 ,952 


(High 
N >1li Temp. low 
| Perm. low 


3,032,355 3,006,661 | 25,694 
318,785 297,401 | 21,384 
10,477,974 10,440,015 | 37,959 











| 
| 
| 








* Significant difference at the 5% level. 
aad Significant ¢ diffe erence at the 1% level. 





5 ‘ The test of significaace of the interaction effect (7 XY) requires equal or etenatinnh ce sell frequencies. The 
frequencies for the two years are tabulated in Section 4. The numbers of cases in each income group in the two years 
are assumed to be sufficiently similar, and large, to justify the test. 
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tween the regression coefficients in the two years do not necessarily imply sig- 
nificant differences between years for particular coefficients. Differences be- 
tween years for particular coefficients can be submitted to tests, however, in a 
search for varying effects of the independent variables. Table 7 shows the re- 


TABLE 7. REGRESSION COEFFICIENTS AND STANDARD ERRORS FOR 
SPECIFIED CLASSES, 1947 AND 1953 








Regression Variables 


Decile Home 


Intercept Age Number Decile cquesed 4Ownwahlp 


33 .12* 12.96* —6.09* —10.93* .92* ay f 
( 8.99) ( 1.37) ( .95) ( 2.97) ( .25) .96) 





34. 86* 18.11* —7.12* —14.36* .37* 81 
All Spending Units A" ( 9.66) ( 2.18) (1.03) ( 3.13) 2 .75) 


— 1.74 — 5.15* 1.03 3.43 A 2.58 
(13.19) ( 2.58) (1.40) ( 4.31) . .99) 


42.45* —20.10* 
(20.87) 2. ( 8.58) 


5.05 J - —14.23* 1.82 
Single-Person Spending (20.34) 3. ( 8.19) d 3.13) 
Units 
37.40 9.1 _— — 5.87 .24 
(29.14) 5. (11.87) 4.12) 
31.53* a —6.12* — 8.69* 1.70 
(11.93) J (1.11) ( 3.45) .95) 





Multiple-Person Spending 7 76 .02* 3° —7.45* —21.58* 44 
Units i | (13.05) 2.8 (1.30) ( 3.59) 2.40) 





b, 58-by!? 44.49* 0: .33 12.89* 1.26 
| o(o,5-n,!7) | (17.68) 3.2% ( ( 4.98) ! 2.58) 
39.13 2. —11.69* 1.34 
(21.70) 2. (1.44) ( 5.97) 1.36) 


High Income Spending 7 97 .09* —9 .96* —30.65* 2. — 6.60* 
Units p47 |} (26.52) ( 4,24) (1.78) ( 7.00) f 2.98) 


57 .96 —10.43* 3.77 18 .96* ; 5.26 
(34.26) ( 4.98) (2.15) ( 9.19) ‘ ( 3.28) 
37 .13* .89 -3.66* OF AS 46 
( 7.47) 2.90) ( Py 2% 1.60) 





Temporarily Low Income i 6.24 3.33 - 2 4.15 


9 Ke 
Spending Units ? (14.97) § (2.14) ( 5.23) .44 3.99) 


30.89 — 2.4 —1.14 —11.38 71 3.69 
(16.73) : (2.33) ( 5.91) ( .49) ( 4.30) 





29.18* 3.28 —6.24* 9.94* .78 1.92 
(14.41) 2.2% (1.58) 4.93) ( .44) ( 1.51) 


Permanently Low Income 7 32.25* 17 .08* —6.52* 13 .98* 1.25* — 46 
Spending Units | 06," (13.19) ( 2.94) (1.41) ( 4.57) ( .42) ( 2.37) 


* Significantly different from zero at a level less than 5 per cent. 
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TABLE 7—(cont.) 








Regression Variables 



































™ . Decile Home 
e Age I 2 Je a 
Intercept Age Number Decile squared Ownership 
| y,58-b,«? | — 3.07 — 3.83 .28 4.06 = 2.38 
| ot.) | (19.54) (3.68) (2.12) (6.72) (61) (2.81) 
| | 
b;5 19.53 22 .63* -- —15.94 1.35 —11.38* 
0», (51.78) ( 8.47) (18.85) (1.42) ( 4.91) 
High Income Single-Per- b;4? 154.18 30.39* — —73.45* 6.82* — 6.96 
son Spending Units | 0,47 (86.76) (12.78) (23 .63) (2.01) ( 6.78) 
| bib? =| 134.65 — 7.76 i 58.41* -—5.47* — 4.43 
| o(o,5-2,47) | (101.04) (15.33) (28.45) 2.46) ( 8.37) 
| b;53 63.91* 4.10 _ —21.83* 2.19 _ 
| o0,5 | (22.33) ( 9.94) (10.69) (1.36) 
| 
Temp. Low Income Single- | },*7 2.13 — .94 _ 12.48 —1.10 —_— 
Person Spending Units | 0»,! (24.95) ( 9.82) (10.97) (1.18) 
b;53—b;47 | 61.79 5.04 — —34.31* 3.29 —- 
| o(o,5»,47) | (33.48) (13.97) (15.32) (1.80) 
| bs 54.73 13 .66* — —31.31* 3.83* 1.57 
| 0,8 (32.52) ( 4.65) (14.05) (1.67) ( 3.58) 
Perm. Low Income Single- | bys? —10.26 23 .23* —_ — 7.12 1.33 — .86 
Person Spending Units 0o,"" (26 .67) ( 4.83) (11.66) (1.32) ( 3.93) 
| b,88-by4? 64.99 — 9.57 - ~24.19 2.50 2.43 
| o,-»,4) | (42.06) (6.70) (18.26) (2.13) (5.32) 
| bys 37.23 10.41* —6.76* — 9.90 .93 .57 
| oo, 5 (28.71) ( 2.65) (1.75) ( 7.56) ( .53) ( 1.34) 
High Income Multiple-Per- | byt? 71.78* 14.58* —8.11* —18.15* 1.57* — 3.02 
son Spending Units 00," (29.54) ( 4.68) (2.07) ( 7.84) ( .55) ( 3.75) 
b,53—-b,47 —34.55 — 4.17 1.35 8.25 — .64 3.59 
(4 ;5-b,4") (41.19) ( 5.38) (2.71) (10.89) ( .76) ( 3.98) 
b,* | 4.99 08 —3.06* 4.15* -— .2% — .25 
oD, | ( 8.38) ( 2.15) ( .86) ( 2.57) ( .20) ( 1.05) 
| 
Temp. Low Income Multi- | },*’ —12.52 8.97 —2.90 8.76 — .41 — 7.01 
ple-Person Spending Units | g»,‘’ (29.95) ( 7.19) (2.62) ( 8.84) ( .66) ( 4.66) 
by58—b,47 17.51 — 8.88 — .16 — 4.61 14 6.76 
7 (b,5*-»,*”) (31.10) ( 7.51) (2.76) ( 9.21) ( .69) ( 4.77) 
5,53 28.56 11.38* —6.09* — 7.05 .48 2.64 
oo, (18.68) ( 2.44) (1.85) ( 5.60) ( .47) ( 1.54) 
Perm, Low Income Multi- | 5,47 80 .44* 10.00* —7.56* —23.83* 1.95* 2.27 
ple-Person Spending Units | 0%,*’ (18.15) ( 4.09) (1.85) ( 5.24) ( .46) ( 3.41) 
by5*-b,47 —51.88 1.38 1.48 16 .78* —1.48* .37 
7 





| 


o(o;5*-5,47) | (26.04) ( 4.77) (2.62) ( 7.67) ( .66) ( 3.74) 





sults of these tests for specified groups including all groups for which significant 
F ratios were noted in Table 6. 
The decline in the mean LA ratio between 1947 and 1953 for all spending 





PROPENSITIES TO HOLD LIQUID ASSETS 483 


units is partially explained by the significant reduction in the regression co- 
efficient for age. The 1947 LA ratio may therefore be regarded as abnormal for 
older spending units. Since the high income group consists largely of older 
spending units, the age effect is consistent with the differences in mean LA 
ratios between income groups. When single-person spending units are separated 
from multiple-person spending units, the single-person spending units seem to 
account for the shift in the LA function of the age variable. The observed 
change in the regression coefficient for age among multiple-person spending 
units is negligible; the change for single-person spending units fails by only a 
small margin to meet the 5 per cent criterion for significant differences between 
years. 

There is no evidence of a shift in the LA ratio function on the income vari- 
able for single-person spending units as a group. As shown in Chart 1, the LA 
ratio functions for 1947 and 1953 intersect, indicating that the 1947 LA ratios 
may have been abnormal for those with higher incomes but not abnormal for 
those with lower incomes. The significant shift from the abnormal 1947 levels 
of the LA ratio for the multiple-person spending units is also shown in Chart 1. 

When all spending units are separated into the three income classes, different 
patterns of variation in the LA ratio between 1947 and 1953 emerge. In the 
high income group there is a significantly lower slope for age in 1953 than in 
1947, again indicating abnormally high LA ratios in 1947 for the older spending 
units. There is no difference in the age effect between years for either of the two 
low income groups. 

The poor regression fit for 1947 in the temporarily low income group does 
not allow meaningful interpretation of the significant inter-year difference 
noted in Table 6 for this group. The inter-year differences in regression co- 
efficients for the permanently low income groups are remarkably small. There 


CHART 1. EFFECT OF INCOME ON THE LA RATIO FOR SINGLE- AND 
MULTIPLE-PERSON SPENDING UNITS, 1947 AND 1953 
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CHART 2. EFFECT OF INCOME ON THE LA RATIO FOR HIGH INCOME 
AND PERMANENTLY LOW INCOME CLASSES, 1947 AND 1953 

LA’/Y LA’/Y 
&0 4 


le 
“<4 


| 


60 a 


T 
so + 








8 9 lo 
Income Decile 


LA’/ Y = bo + b;D + by D? 


High Income Class Permanently Low Income Class 


is no evidence of any shift in the LA function for the permanently low income 
spending units, and the 1947 LA ratios for this group must be regarded as 
normal in the context of their needs and preferences for liquid reserves. 

The effect of the level of income for individual spending units in the income 
classes is shown in Chart 2 which contrasts the high income class with the 
permanently low income class. The functions have been plotted for the entire 
range of income decile because it is possible for a spending unit to have an in- 
come which deviates considerably from the mean for his class. It should be 
noted, however, that the observations tend to cluster above the 7.5 income 
decile for the high income class and below the 7.5 income decile for the per- 
manently low income class. According to the t-tests in Table 7 the inter-year 
differences in the income effect are statistically significant in the high income 
group. The right branches of the parabolas for 1947 and 1953 indicate a lower 
propensity to hold liquid assets in 1953 than in 1947 for this group. 

Of the six income-spending unit classes submitted to F-tests for net increases 
in explained variances by separating years, only two tests showed significant 
differences. In the class consisting of temporarily low income, multiple-person 
spending units, only one of twelve regression coefficients differs significantly 
from zero. The large unexplained variance for this class precludes further 
analysis. In the other of the two classes, single-person spending units in the 
high income class, there is a significant change from a parabolic income effect in 
1947 to no income effect in 1953. Noting again that the most relevant range of 
income deciles for this group is above the 7.5 level, it appears that World War IT 
conditions resulted in abnormally high liquid asset accumulations among 
single-person spending units in high income groups. The 1953 data for similar 
people show that equity in a home is regarded as a favorable substitute for 
liquid balances and there is no tendency for the LA ratio to increase with 
income. 
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7. CONCLUSIONS 


Perhaps the most important single conclusion to be drawn is that there is no 
such thing as “the” propensity to hold liquid assets. The evidence indicates 
that the motivations for holding liquid balances vary widely and that the moti- 
vations can be delineated—if not defined—by a demographic and economic 
classification of consumers. 

The LA ratio is positively related to age in each of the years 1947 and 1953. 
The rise in the LA ratio with age has limits, however, as indicated by the ab- 
normal levels attained in 1947 by the older singie-person spending units, and by 
the older high income spending units. Thus the evidence confirms the hypothe- 
sis that precautionary reserves increase when spending units need more reserves, 
are able to cumulate more reserves, and have a lower relative preference for 
consumption. 

A preference for consumption and other non-liquid forms of saving over 
liquid balances was posited for larger families, and the evidence supports this 
assumption. The LA ratio is generally higher among single-person spending 
units than among larger spending units. Furthermore, the regression coefficient 
of the size variable is consistently negative. The preference for satisfaction de- 
rived from sources other than liquid balances is also suggested by the abnor- 
mally high intercept in 1957 for multiple-person spending units. 

The effect of the level of income on the propensity to hold liquid assets varies 
between the groups examined. The LA ratios of 1947 were abnormally high for 
spending units who had high incomes. The high income people had evidently 
passed a point of saturation with liquid reserves, but this conclusion applies 
particularly to the single-person spending units. In 1953 there was no evidence 
of a relationship between the LA ratio and the level of income among the high 
income, single-person spending units. Moreover, this was the only group show- 
ing a tendency to let equity in a home substitute for liquid reserves. 

Like the high income spending units, those in the temporarily low income 
class had abnormally high LA ratios in 1947. The decline in the LA ratio from 
1947 was greater, in relative terms, for the temporarily low income, multiple- 
person spending units than for any other group. This sharp decline, along with 
the very low level of the LA ratio in 1953, suggests a low propensity to hold 
liquid assets. As implied by the Modigliani-Brumberg [6] and Friedman [1] 
theories of saving,® the expectation of higher incomes may reduce the need for 
current saving. 

Those in the low income class may have been able to fulfill a latent need for 
more security in the form of liquid balances during the prosperous periods of 
World War II and the following years. Their LA ratios in 1947 could not be 
considered abnormal relative to comparable LA ratios in 1953.’ This finding is 
reinforced by the data for 1953 for low income, single-person spending units. 
As income increases among these spending units the LA ratio declines but then 
rises in the higher range of income decile. This behavior suggests a general need 





6 See also Harold W. Watts, “Long-run income expectations and consumer saving,” Studies in Household Eco- 
nomic Behavior, Yale Studies in Economics, Vol. 9, Yale University Press, 1958. 

7 It is possible that the low income class did not accumulate unusually high liquid balances during World War 
II. In the absence of data to support this belief I choose to interpret the findings as if the permanently low income 
class had been forced to save. 
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for the security provided by liquid balances among low income spending units, 
a need which is partially fulfilled only when the income earner has no de- 
pendents and when his income rises beyond the fourth income decile. 

The results of this paper differ, in part, from those found in two other recent 
investigations of liquid asset holdings. Tobin and Watts [8], in the course of 
an extensive analysis of many stocks and flows measured by the 1950 Budget 
Study, Bureau of Labor Statistics, calculated regressions for “cash in bank” 
and changes therein. They found that variations in both stocks and flows of this 
component of liquid assets were associated with differences in age, size of family, 
disposable income, and housing level as well as some other variables not in- 
cluded in this analysis. After removing demographic effects they found that 
cash balances and flows are related to stock of durable goods, mortgage debt, 
and installment debt. 

H. F. Lydall [5] has emphasized net worth as a determinant of liquid asset 
holdings. He concludes from an analysis of British spending units that net 
worth is much stronger than income in determining the level of liquid asset 
holdings. The effects of demographic variables were not considered explicitly. 

The propensity to hold liquid assets is by no means simply determined. At 
least three major kinds of variables are involved: demographic characteristics 
of the household, the stock of non-liquid assets, and the current and expected 
income position of the household. The three types of variables are themselves 
highly inter-correlated and there may be complex interaction effects among 
them. We still lack a theory which will explain satisfactorily the level of liquid 
asset holdings and the relationship between these reserves and saving behavior. 


APPENDIX 


Income deciles are observed for individual spending units in each Survey of 
Consumer Finances. The income deciles are established by arraying absolute 
incomes and dividing the array into tenths. The income deciles range from 1 
for the lowest 10 per cent of incomes to 10 for the highest 10 per cent of in- 
comes. Tables 1A through 1C show the weighted means of income deciles for 
groups of spending units classified by race, region, occupation, education, 
and age. 

Data for four years, viz. 1947, 1948, 1952, and 1953, have been combined in 
constructing these tables. Tests performed by Robert Summers indicate that 
differences between group means for individual years are not statistically 
significant. 
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TABLE 1A. MEAN VALUES OF INCOME DECILE POSITION FOR SELECTED 
SPENDING UNITS 1947-48 AND 1952-53! 








White, South, Non-Farm 





Education Professional, 
Managerial, Skilled Unskilled Other Retired 
Business, Clerical 





Grade-School or less: Age 
18-24 .5-.3 
.6/74 


25-34 3-2 
8/95 


.1-.2 
1.0/109 


.-. 


2 
afen 





2.3-.2 


1.1/30 











High-School: Age 
18-24 


1.2/162 


6.9-.2 
.9/137 


6.3-.2 
1.1/148 








5 2-5 


5 
3/42 
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TABLE 1A—(cont.) 





White, South, Non-Farm 





Education Professional 
’ 


Managerial, Skilled Unskilled Other Retired 
Business, Clerical 





College: Age 
18-24 3.8-.3 
| .3/38 





3.7-.5 
.2/22 











| 
} 
65+ 4/66 





1 Entries in Table 1 consist of A—B, C/D, where A =mean value of income decile; B =standard error of mean 
income decile; C =proportion of sample represented in the cell; D =number of cases. 


TABLE 1B. MEAN VALUES OF INCOME DECILE POSITION FOR SELECTED 
SPENDING UNITS, 1947-48 AND 1952-53} 


v2 





White, Non-South, Non-Farm 





Education 


Profes- Business Managers, Clerical Pe Un- ‘ 
. SKE I -) 
sional Owners Officials and Sales killed skilled Other Retired 


Grade-School or 
less: Age 
18-24 








.O-.2 
1.0/124 


2. 9 
oo”. 








PROPENSITIES TO HOLD LIQUID ASSETS 


TABLE 1B—(cont.) 


White, Non-South, Non-Farm 





Education EA ‘ 
Profes- Business Managers, Clerical us Un- - 
, Skille 
sional Owners Officials and Sales “killed skilled Cier =—- Basted 
High-School: 
Age 
18-24 3.3-. .3-.§$ 3.9-. 3.6—.1 .O-.2 3.8-.3 

1.4/202 1.2/146 .5/ -3/32 


6.3-.1 .&.1 oo. .1-.3 
1.8/239 3.7/451 


6.7-.1 .3-.1 
1.4/209 3.0/382 


45-54 a. oo, . 6.9-.2 4.1 
1.3/183 1.5/200 





55-64) .8~.1 .-. 6.3-.2 


2.2-.1 
1.6/197 





College: Age 
18-24 





.2/39 


* Less than one-half of one per cent. 

** No observations. 

| Entries in Table 1 consist of A-B, C/D, where A =mean value of income decile; B =standard error of mean 
income decile; C =proportion of sample represented in the cell; D =number of cases. 
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TABLE 1C. MEAN VALUES OF INCOME DECILE POSITION FOR SELECTED 
SPENDING UNITS 1947-8 AND 1952-3! 


Not White 


Farmers 
South | Not South 








Grade High Grade High Grade High 
School School School School School School 
or less or more or less or more or less or more 


2.2-.2 2.9. 4.1-.9 3.5-.3 
.3/26 ‘ .4/40 


3.3-.2 
.5/48 
1.3/205 


3.8—.2 
1.4/204 


.o-.2 a>. 
1.3/213 .3/49 


65+ OO. 9.3 | 2.9-.2 3.9-.6 
.9/149 .2/29 














1 Entries in Table 1 consist of A —B,C/D, where A =mean value of income decile; B =standard error of mean 
income decile; C =proportion of sample represented in the cell; D =number of cases. 





EARLY FAILURES IN LIFE TESTING 


Rupert G. MI.uEr, JR.* 
Sandia Corporation 


Data from certain life test experiments exhibit such an unusually high 
concentration of failures near time 0 that the assumption of an over-all 
exponential density is unwarranted. One hypothesis for this phenome- 
non is that due to faulty construction or defective parts, certain test 
items fail prematurely; these items are termed “early failures.” To 
handle this type of data, an early failure model is postulated in which 
one failure rate is assumed to be in effect for an initial time interval 
[0, 7c) and another, lower failure rate is operative thereafter. Es- 
timators for the two failure rates are given in the case where 7'y is 
known and in the case where 7’) is not known exactly but can be as- 
sumed to be within a specified interval. Methods for obtaining ap- 
proximate large sample confidence regions are outlined and procedures 
for handling small samples are described. 


1. INTRODUCTION 


N THEIR fundamental paper, B. Epstein and M. Sobel [4] postulated a model 
I for life test experiments and exhibited means by which such experiments 
could be analyzed statistically. To summarize briefly, a life test experiment 
consists in testing N components or items (whether they be radio tubes, ca- 
pacitors, etc.) until n of them have failed, and the basic assumption is that the 
probability density for length of life is exponential with parameter 1/8, i.e., 


1 
pri(t; @)dt = rj e~ */¢dt, t= 0. (1) 


The maximum likelihood estimator of @ is 


6b, = fv ton = ites + tan + (N = ew (2) 


n 


where tiv S --- St,y are the ordered times of failure. The desirable properties 
of 6,,. are numerous. In particular, 6,7 is exactly distributed as (6/2n)x3,, and 
it is a sufficient, efficient minimum variance unbiased estimator of 6. These re- 
sults and the corresponding hypothesis testing techniques based on 6, are 
discussed in reference [4]. 

This model has been extensively applied, and the consensus is that the ex- 
ponential model is a reasonable assumption in many instances. However, fre- 
quently the experimental data indicate that the exponential model fails to hold 
exactly because the experimenter is confronted with “early failures.” The data 
suggest the hypothesis that the items under test are composed of two groups. 
There are those which have been constructed according to specification and 
will eventually fail randomly; the life span probability density for these items 
is exponential. On the other hand, there are also those components which, due 
to inferior quality or faulty construction, will fail shortly after being put on 
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test. These “early failures” preclude the assumption of an exponential density 
and the use of existing techniques to estimate the mean life of “standard” com- 
ponents. 

This paper represents an attempt to formulate a probability model in which 
it is possible statistically to handle early failures. However, it by no means 
constitutes a solution to all early failure problems. As with all probability 
models, whether or not it can be applied to a particular experimental problem 
should be decided by scrutiny of the assumptions underlying the experimental 
process and the agreement between the theoretical and sample distributions. 


2. EARLY FAILURE MODEL 


The exponential density (1) assumed by Epstein and Sobel arises from con- 
siderations on the rate of failure of test items. For a component under test the 
failure rate at time ¢, \(¢), is defined by 


— PIT >t} -—P{T>t+ at} 
lim ———____—— am 
Ato At-P\T > t} 


——" =, (3) 


where the random variable T is the time of failure. The limit expression (3) 
indicates that the conditional probability of the item failing in an infinitesimal 
interval of length At after ¢, given it has survived to time ¢t, is approximately 
d(t) At, or, equivalently, 


P{T >t+ At} = (1 — AWA)P{T > t} + o(Ad. (4) 


\(t) is the parameter of a possibly nonhomogeneous pure death process (see [5], 
Ch. XVII). 

If it is assumed that A(t) =A=1/8 for all ¢, a differential equation argument 
shows the probability density for time to failure is given by (1). Thus, the as- 
sumption of an exponential density is equivalent to the assumption of a con- 
stant failure rate. 

The early failure model permits the failure rate to vary with ¢ in the simplest 
possible fashion, namely, 


Ai, 0 Ss t< To, 
A(t) = 


Aa, To S14, 


(5) 


where \;> 2. An analogous differential equation argument shows that the 
probability density corresponding to this failure rate is 
hie dt, 0O<t<T, 
prit; AL, do)dt = a aye T, <1. (6) 
The analysis throughout the remainder of the paper will be in terms of \; 
and 2, although it could just as well be performed in terms of 6:=1/\; and 
62 =1/X2. The choice of \; and \, was prompted by little more than notational 
simplicity. 
The intuitive justification for the early failure model follows from the fact 
that when a component is put on test, it is not known whether it is an “early 
failure” or a “standard” item. Since some will be early failures, the failure rate 
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on the average will be high at the start, but if an item has survived for a certain 
period of time (7'o), then it cannot be an “early failure” so its failure rate will 
be lower for the succeeding time period. 

An alternative model is to assume that the population of components is com- 
posed of two groups with different failure rates, \,; and 2, and the early failure 
group comprises a proportion p of this population. In this model the estimation 
of 1, Xx, and p is complicated unless each failure is examined to determine to 
which sub-population it belongs; for this latter approach, the reader is re- 
ferred to a recent article by W. Mendenhall and R. J. Hader [6]. For some ex- 
perimental programs, however, it is not always possible to examine each failure 
to determine its cause. 

The particular test structure in which the early failure model will be applied 
is the following: N items are put on test and are tested until failure or time 7), 
whichever is sooner, where 7;>7'o. Due to limitations on experimental time, 
no item is tested past 7;. For this censored test structure, let 7* be the time 
at which the test terminates for any particular unit. For 7*=t< 7, the value 
7* =t denotes the failure of the item at time ¢. 7* = 7, denotes the item had not 
failed when the experiment was terminated at 7. The early failure model de- 
termines the probability distribution of T* to be 


pre(t;A1, As)dt = Aye" “dt, O<t<T, 
_ Age To-A2(t—-To) 7 To < t < T, (7) 


P(T* = Ty) = rm, 


In the event that the experimental procedure is to test all items to failure, 
tHe above expressions and all succeeding expressions become the limiting ex- 
pressions as 7; ~. 


3. LARGE SAMPLE THEORY 


If N components or items are tested to failure or time 7, the experimental 
data consist of an ordered sample (4, - - - , tn, 71, ---, 71) wheret:S +--+ St, 
are the ordered failure times (prior to 7;) and the N —n T;’s represent the N —n 
components surviving at 7,. If the experimenter assumes the probability 
density of time to failure is exponential with parameter \=1/0, the maximum 
likelihood estimator of » is 

n 


As —__——— + (8) 
dt + (N — n)T; 
1 


As N-o, 4 will converge in probability to ; ie., for all e>0, 
P{|\—| >e}0 as N-~. However, if the assumption of a constant failure 
rate is incorrect and in fact the early failure model pertains, 4 will converge in 
probability to 

1 = e7*1To—hr2(T1—-To) 
ii, 73° — eee 


satin (1 _ e~ To) + i e~MTo(] ~ e~2(Ti-To)) 
1 Ao 








494 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1960 


(see Appendix). By choosing plausible values of \, \2, and 7'o, the experimenter 
can determine from (9) the magnitude of the error he may be making by not 
assuming the early failure model. 

In the estimation problem for the early failure model it is necessary to dis- 
tinguish two principal cases. 

Case I. T’) is known. 

Let (4, +++, tr, tear, > * +, tn, T1, +++, 71) be an ordered sample of N ob- 
servations whose common distribution is the distribution (7). The observations 
are ordered in the manner 4:5 --- St-<ToStaiiS +--+ Sta<Ti, and the 
final sequence of 7;’s represents the N —n items which survived the test. For 
this sample, the maximum likelihood estimates of \; and dz are 


r 


hi me ) (10) 
D> ts 4- (N a r)To 





w=? 


—_—___—___________ , 


n 


4 
} 
| 
| 


1 do (ti — To) + (N — n)(T1 — To) 


| r+ 


| 
\undefined, r= N. 


The case of primary interest is 0<r<WN since the probability of the comple- 
mentary case tends to zero as N+ «. 

(10) and (11) give point estimates of \; and 2. The exact distribution of 
(Xi, 2) does not have a simple form, and consequently it is not feasible to de- 
termine an exact confidence region for (Ai, \2). However, in the Appendix it is 
shown that as N—>~, the estimators \; and 4, have the following properties: 

(t) 4X: is normally distributed with mean ), and variance 


2 


Ai 


(it) }, is normally distributed with mean ), and variance 


Nest — ett)’ ae 

(itz) Ky and i, are independently distributed. (12727) 

By virtue of (7)—(iiz) for N sufficiently large, an approximate confidence region 

with confidence coefficient 1—a consists of those points (Ai, \2) satisfying the 
following two sets of inequalities: 





N PN —i 1 — en To)1/2 
a a i Re 

At 
VN(h2 - Aa)e~™7 0/2] — e-A2(Ti—To) |1/2 


Az 


«x< 
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where K, is defined by 


—— 1 Ka 
Vl-a=— e722, (14) 
/ 24 —K, 


One-sided confidence regions can be constructed similarly. In each case, it can 
be easily shown that the confidence region is simply connected in the (Au, A2)- 
plane. 

A confidence ellipsoid constructed from the x3 distribution also yields a 
confidence region, but it is harder to determine numerically than the above 
region. 

Case II. 7) is not known exactly, but 7: 79S 7; for known values T; and 7}. 

The jump discontinuity in the failure rate \(¢) at 7’) is somewhat artificial 
and it may therefore be very difficult for the experimenter to designate an exact 
value for 7’. However, he may be able and willing to make the following state- 
ment: If a component or an item fails before T2, it is most likely a dud or early 
failure, but if it survives to time 73, the item cannot reasonably be classified as 
such. Accumulated evidence will very likely indicate values for T; and 73. For 
the estimators of \; and , which are proposed in this section, the exact value 
of 7) is immaterial so long as it lies in the interval [72, 7s]. 

For an ordered sample of N observations 


(ty, Ph » be, bees, lal ” te41, > ton » ta, T1, no oe 7) 
where 


iS --- "9 -*+- S4ETa<har B+ 2 < Ty 


the maximum likelihood estimators (10) and (11) of \; and 2, are no longer 
actual estimators since 7’) is not known exactly. One possibility is to estimate 
simultaneously 7’) by the principle of maximum likelihood or, perhaps, quasi- 
maximum likelihood (see [1]). However, the computations involved in esti- 
mating 7’) are extremely laborious. This, coupled with the possible artificiality 
of To, lends cogency to the following alternative scheme. Disregard the ob- 
servations t,41, - - + , t,in the interval [72, 7;] and estimate \; and \s by 
rT 


Ai = ert ’ (15) 
> t; + (N — r)T 2 





eS = @ 





| = s<QN, 
he = { 2. (t; — Ts) + (N — n)(T, — Ts) (16) 
s+1 


(undefined s=N. 


The case of primary interest is r>0 and s<WN since the probability of the 
complementary case tends to zero as N> ~., 

ii and i, are not maximum likelihood estimators, but they are consistent and 
asymptotically jointly normally distributed. As in the previous case, the exact 
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distribution of (\;, \:) for finite sample size N is not simple, but for large sam- 
ples the asymptotic normality can be employed to obtain an approximate cone 
fidence region for (Ax, \2). Precisely, as N— ~, 


(¢) \, is normally distributed with mean )\, and variance 


2 
Ai 


Na = ery’ 


(iz) }: is normally distributed with mean dz and variance 


2 
A2 


7 Ne-MTs(] dime e—2(T,-T3)) 


(1712) 


(iti) }, and dy are independently distributed. (17722) 


For N sufficiently large, an approximate confidence region with confidence co- 
efficient 2 1 —a consists of those points (Ay, \2) satisfying the following two sets 
of inequalities: 


5 7 7 a i hiT2) 1/2 
- 5.5 So ee 
1 


Nha — dae MTA — ety 
. : : t= a s a (18) 


— 


where K, is defined by (14). 

In this section on large sample theory it has been suggested that for N suffi- 
ciently large, approximate confidence regions may be constructed with the aid 
of the normal distribution. The question which naturally arises is, how large is 
“sufficiently large?” No results have been obtained on how fast the approxima- 
tions of this section approach the true situation. There is an article, however, 
by W. L. Deemer, Jr., and D. F. Votaw, Jr. [3] in which a sampling experiment 
was performed on a censored exponential distribution. The outcome indicates 
that for sample sizes of 100 the analogous approximations are fairly good. 
This, of course, says nothing about the approximations under consideration 
here, but it would be rather surprising if the rates of convergence were quite 
dissimilar. 


4. SMALL SAMPLE THEORY 


The problem of what to do when the sample is small is invariably messier 
and more difficult than the corresponding large sample problem. Also, any solu- 
tions or partial solutions are less satisfying, both mathematically and aesthet- 
ically. Nevertheless, the problem still remains. 

One of the more sensible solutions for the early failure problem is to replace 
the probability density by a multinomial distribution whose probabilities are 
determined by (6) and the intervals delineated by the points 7, 71, T2, and 7’. 

Case I. To is known. 
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Let 
p=P{O<sT<T} =1—e™%, 
P2 = P{  isT< T:} = eTo[] — ea(Mi-To)], and (19) 
D3 = P{T > T} = E—To—2(T1-T 9), 


For an ordered sample (hh, - - - , t, bai, °° +, tn, T1, - + + , T1) of N independ- 
ent observations, the estimates of p:, po, and p; are 


as 


If these estimates are substituted into (19) and any two of the resulting equa- 
tions are solved for \; and do, the estimates 


bs 

—log (1 — px) —loe (; a = 
1= aeairene 9 2= nhs aeoiapeat 5 (20) 
To T; 7: To 
of \; and 2 are obtained. (Note that the estimates of Section 3 are the maxi- 
mum likelihood estimates for every N, but for small N no confidence region 
using them can be constructed conveniently.) 

To construct a confidence region for (Au, Ae), it is sufficient to put a confidence 
region on any two of pi, pe, or ps by using fi, f2, or f; and the trinomial dis- 
tribution, and then solve for those values of (Ai, \2) which give values of p1, pe 
or p; in this region. Unfortunately, the trinomial distribution is not tabulated. 
Two alternatives can be suggested. A confidence interval can be put on any 
one of pi, po, or ps by using the corresponding f,, fe, or £3 and the binomial dis- 
tribution. Or two regions on two different single parameters p1, p2, or p3 can be 
combined according to the rule P{ ANB} =1—P{ A*} —P{ Be}. In this latter 
case, the confidence coefficient is not known exactly but is bounded from below. 

If an exact confidence coefficient 1—a is desired, a reasonable procedure is to 
place the confidence interval on p;. The resulting confidence region for (Ai, dz) 
consists of those values which satisfy 


— log Pa(ps) S To + A2o(Ti — To) S — log palfs), (21) 


where ja(f;) and pa(fs) are the upper and lower 100(1 —a)% confidence limits, 
respectively, for the binomial parameter p(=p;) when the estimate is £(= fs). 

If it suffices to have a confidence region with confidence coefficient 2 1—a, 
then it is preferable to combine the regions on p; and p;. This leads to a con- 
fidence region consisting of those values of (Ai, \2) which simultaneously satisfy 


—he — pth) << ~*s (1 — eld) | 
To i To 
— log p.(ps) S ArT. + A(T: — To) S — log pps), 


im 


where v=a/2. 
Case II. T) is not known exactly, but 72S T) S73. 
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y= P\0 = 7 < T>} = |] — ei", 
P2 = P} Ts < T s T3} = eT s(1 als P); 
ps = P{T;<T < 71} = pe™™[1 — e279], and (23) 


ps = P{T = 7,1} = pete etrTo, 


where p is actually equal to P{T>T7 3|T2=T2} =e>T-T22T-T0, For an 
ordered sample (4, «++, t, tra, +++, te, teat, + °°, ta, Ti, > ++, 11), the esti- 
mates of \; and d, derived from the binomial estimates are 


log (1 — 1) log (1 +5) 
pe 5 de a > Le (24) 


i T, “— T; 





The problem of choosing a confidence region for (Ay, \2) is more difficult than 
that encountered for 7’) known since here the distribution is quadrinomial. The 
simplest and perhaps most reasonable region with confidence coefficient 
= 1—ais the set of all (Ax, Az) satisfying 


— log p(ps) S MT’ + X(T: — T’) S — log pfu) (25) 


for some T’ in [T2, 73]. This confidence region is generated by placing a bi- 
nomial confidence interval on the parameter ju, letting p vary over its natural 
range [exp (—\i(73—T2)), exp (—2(7's—T2)) ], and projecting the resulting 
region onto the (Au, \:)-plane. It has probability 2 1—a of containing the true 
point (Ai, A2) regardless of the value of 7’». 


5. RELIABILITY 


The reliability of a component or item is usually defined to be the probability 
that it will function for a specified length of time, or equivalently, the prob- 
ability it will not fail in this period. Precisely, let R(t) be the probability that an 
item will survive for ¢ units of time. 

In the case where t< 7’) and nothing has been done to eliminate early failures, 
a reasonable estimate of R(t) is exp (—4it) where \, is either given by (10), 
(15), (20), or (24). A confidence interval for R(t) with confidence coefficient 
l1—a is given by (exp (—Ait), exp (—:t)) where \; and A; are the upper and 
lower confidence limits, respectively, on , determined from either (127), 
(172), or pr. 

Because of the reproductive property of the exponential density, a possible 
procedure of eliminating early failures is to pre-test or burn in all components 
past the change point 7. If the failure rate is constant above 7’, all pre-tested 
items will have an exponential density with parameter 2 for time to failure. 
A reasonable estimate of R(t) now is exp (— st), and a confidence interval with 
confidence coefficient 21—a can be constructed by the following procedure. 
Project the confidence region for (x, \2) generated by (13), (18), (21), (22), or 
(25) onto the d»-axis. This will produce two limits, \; and dz, for \: which de- 
termine a confidence interval (exp (— et), exp (—gt)) for R(é). 
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The case in which ¢> 7'y) and the components have not been pre-tested is more 
complex since R(t) involves \y, de, and 7'o. If 7’) is known, exp (—417'o—do(t— T0)) 
estimates R(t) and a confidence interval with confidence coefficient = 1—a is 
constructed by determining the range of values of \y7'9+A2(t—7'o) as (Au, Az) 
varies over its confidence region. If 7’) is unknown, no estimate of R(t) depend- 
ing on the observations only through 4, and }, exists, but (exp (—X(7's), exp 
(—A(T2)) does constitute a confidence interval for R(t) with confidence 2 1—a, 
where 


X(T 2) = bag [\iT2 + Ao(t aad T:)|, X(T's) = bone [yi 7's + Ao(t - T:)], 
1,A2) 1,A2 


and the min(max) are taken over the confidence region on (A, \2). 
6. APPENDIX 
1. AA. 


By definition, }=n/( p t:-+(N—n)T1) where (4, ---, tn, T1,°°-, Ti) 
is an ordered sample of N independent observations on the distribution (6) 
censored at 7. This can be rewritten as 


n N 


—— + —____—- ) 
N N 


Dt 
1 


where (¢,*, - - - , ty*) is an unordered sample of N independent observations . 
on the distribution (7). The random variable n has a binomial distribution 
with parameters N and 


an — p-A1To-r —T 
Pp os 1 é 1To 2(Ty 0), 


and ¢,*, - - - , éy* are independent observations on a distribution with mean 


1 1 
E(T*) = — (1 — e-P) + — e701 — e-(T1-70)), 
r Ae 


1 


Therefore, as N>«,n/N 2,p and >-* t*/N 2, E(T*), so by Slutsky’s theorem 
(see [2]), 12 p/E(T*) =A(Au, 2; To, 71). 
Note the convergence is also almost sure. 


2. Asymptotic normality of (i1, 42); 7’ known. 


The likelihood equation for the ordered sample (th, ---, t, trai, ° °°, tn, 
T1, -- + , T:) of N independent observations on the distribution (6) censored at 
Ti is 


N! 
L = ———_ 
(N — n)! 


Ai’ Aa exp | -as ) ‘| 
1 


-exp| =(n — r)AsTo — 2 . (t;— r%) | 


r+1 


“exp [- N — n)A1T 9 —_ (N _ n)r\(T; —_ To) |. 
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It is readily verified that the estimates ii, \: given by (10), (11) maximize L, 
i.e., \: and dy are the maximum likelihood estimates. 

The proof of the asymptotic normality of (ii, d2) is a routine application of 
the theorem on the asymptotic behavior of maximum likelihood estimators. 
Cramér’s conditions [2] for this theorem are satisfied so ~/W(\i—\1) and 
VN(\.—s) are asymptotically jointly normally distributed with mean (0,0) 
and covariance matrix equal to the inverse of the information matrix whose 
elements are given by: 


0 log Lr 2 To / l 2 
Bl “| 7 f ( a ) Are ‘dt + Ty’e-™, 
Or, 0 Ai 


l 


2 


1 


9 log Lire]? n/1 ’ 
OX2 T, \Az 


+ (Ty — To)%eATa'T-T0), 


or i 


l 
= —e “MT o(] die ew ralT) To)) 
X 2 
° 


A ry Lire O g¢ Lr T 
ab log [ rT ) log I | ~ = ro} f G dv ties 1.) hoe-MTo-Pa( (Todt 
L Or, Ore Lf T, A2 


— (T; — To)e-*™ ninth ~®, 


where Lr« is the likelihood function of the random variable 7*. 
3. Asymptotic normality of (A:, \:); 7’o unknown except for 7:5 T)S7;. 


For an ordered sample (hh, - + - , tr, trai, °° * y bey begat, 6 °° boy Tr + + + Ty) 
of N independent observations on the distribution (6) censored at 7, the pro- 
posed estimator of (Ay, \2) is given by (di, de) of (15) and (16). The theorem on 
the asymptotic normality of maximum likelihood estimators cannot be applied 
directly since 4; and 4, are not maximum likelihood estimators of \; and dg, for 
the distribution (6) censored at 7';. However, it is possible to apply the theorem 
indirectly. 4; and i: do not involve the actual values of t,4:, - - - , é, 80, as far 
as }; and i, are concerned, the sample is (4, - +--+, t, 7, ° °°, 7, lett, °° *% 5 bny 
Ti, - ++, 71), where 7 in the i-th coordinate denotes that the i-th ordered ob- 
servation occurred in the interval [72, 7;]. But this sample can be viewed as a 
sample of N independent observations from the distribution 


Prex(t)dt = de i ‘dt, 0 st< T2, 
P{T* =r} = (1 — p)e™”, 
DP rex (t)dt = pr2e- MT a(t Todt, T; <t< rt 


P} [** _ Ti} = pe-MT2-r2(Ti-Ts) | 
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where 0<p<1. Actually, of course, p is equa! to the value 
p* = P{T* > T,| T* & Ta} = eT eT0), (27) 


If for the moment it is forgotten that p is actually given by (27), then (26) is 
a distribution which depends on three arbitrary, unrelated parameters, Ax, 2, Dp. 
Calculation of the likelihood of the sample readily verifies that (1, d:) is the 
maximum likelihood estimator of (Ax, 2) for this distribution involving three 
unknown parameters di, \2, p. The conditions of the theorem for asymptotic 
normality are satisfied so ~N(\i—\1) and ~W/N(\2—d2) are asymptotically 
jointly normally distributed with mean (0, 0) and covariance matrix equal to 
the inverse of the information matrix whose elements are: 


0 log Lr««x 7 TM /1 2 
ee ed = f (— i :) \ie™™ ‘dt + T27(1 ~ e-™72), 
Or, 0 Ai 


1 —AyT: 
pers 1"), 


alog bre}? ph/l 2 
EK | 10g —| = f (— a (t ire r.)) pr. My Toho (t—-T 3) ft 
Ore T, \A2 


3 
+ (T, es T 3)*pe-MTs2(M1-T a) 
pe-7[1 = ea(Ti-Ts) | 


Ae? 


r I 0 leg Lr 1/1 
“| Te 0 og al — r} f & ne (t - 1%) prse-MTxa( Tat 
Or, OX2 r® \Xe 


Il 








Since, however, the actual value of p is known, there is no harm in using it. 
Thus, the asymptotic variances of }; and }, are 


2 


% = M(L — e™2)-1, a, = dj[pte™(1 — eM-T)]-1, (28) 


The results (28) are not serviceable for constructing confidence regions since p* 
involves the unknown quantity 75. But p*= exp (—Ai(7;—T2)) for 12 >:, 
regardless of the value of 7'o, so if p* is replaced by this bound, an upper bound 
is achieved on the variance of,? which permits the construction of confidence 
regions. The bound exp (—\:(7';—T7:2)) also has the desirable property that it 
is a lower bound for p even if the failure rate \(t) decreases in [T72, T;] other 
than by a jump discontinuity. This means that the results of (17) hold for all 
failure rate models in which X(t) decreases monotonically from ), to dz in the 
interval (72, Ts]. 
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MATHEMATICAL MODELS FOR RANKING FROM PAIRED 
COMPARISONS! 


H. D. Bronx! 
University of Missouri 


Brief mention is made of several models in each of two categories: 
(I) Each possible ranking of items is assumed to have a “utility” which 
depends on the expected scores of the items in paired comparisons. In 
particular, the “worth” of an item may be defined in terms of its ex- 
pected scores in comparisons with others. (II) Each item is assumed 
to have an intrinsic worth; these intrinsic worths determine the ex- 
pected scores. 

A concept, “regularity” is introduced. Under (I), general linear utili- 
ties are discussed, and a necessary and sufficient condition is given in 
order that a linear utility may be regular. 

Under (II), a “minimum assumption” model is introduced. Let 
e(u, v) denote the expected score of an item of worth u when compared 
with one of worth v. The assumption is: e(u, v) is non-decreasing in u, 
non-increasing in v. The problem of estimating expected scores in this 
model is discussed. 


1. INTRODUCTION 


ost models for ranking from paired comparisons appear to fall in one or 
the other of two broad categories. Each of these categories is discussed in 
general, with examples, and a new model is proposed in each. 

The typical situation involves a number of comparisons, each between a pair 
of individuals or items. The result of a single comparison may be a stated prefer- 
ence for one of two foods presented to a judge. It may be the winning of a game 
by one of two players. It may be that a laboratory anim.l responds to one of two 
competing stimuli. In each of these situations, a simple “preference” is ex- 
pressed for one or the other of two “items.” More generally, as a result of a 
comparison a “score” is assigned to each of the two items. In the simplest 
situation, that in which a simple preference is expressed, 1 is scored for the 
preferred item, 0 for the other. The score of an item on a single comparison 
may be thought of in this case as the “number of preferences expressed for the 
item.” In generalizing so as to admit the possibility of a tie, one may give score 
3 to each item in the case of a tie or a draw. In general, if items J; and J; are 
to be compared, we suppose scores 82 (the score of J; in a comparison with J2) 
and sq are to be assigned so that 

82 + 8 = 1. (1) 
(There is thus adopted here a convention different from that of Scheffé [12]; 
if the score by the above convention is denoted by s and that referred to by 
Scheffé (reduced by a constant factor so that the maximum score is 1) by S, 
then S=2s—1. It is noted further that Scheffé admits the possibility that the 
order in which the items are presented to a judge may affect the score. This 
consideration, important in many situations, is neglected here.) 





1 This research was supported by the United States Air Force through the Air Force Office of Scientific Research 
of the Air Research and Development Command under Contract No. AF 18(600)-1108. 
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Each item presented to a judge may have been selected at random from a 
population of similar items. Each judge may have been selected from a popula- 
tion of judges. The state of a player at a particular game might be considered 
as randomly chosen from a population of possible states of that player. These 
remarks are intended to suggest that it is appropriate to consider the score of an 
item as a random variable. One accordingly postulates the existence of an 
underlying probability space. Let there be m items, I), Iz, - - - , Im, to be com- 
pared in pairs. 


Basic assumption 


There is a probability space on which are defined random variables S;;, “the 
score of item I; in a comparison with item I;” (i, 7=1, 2,+-+,m,i¥j) (Al) 
such that Siti =], 


Repeated independent trials of a comparison between specified items may be 
considered on an appropriate Cartesian product space in the usual way. 

Since the range of possible scores will ordinarily be bounded, it is not a 
serious restriction to assume that moments of all orders of the random variables 
S,; exist. In particular, the mean will be denoted by e;;: 


E(Sx;) = ej. (2) 


The m Xm matrix of which the element in the i-th row and j-th column is e;; 
is the matrix of expected scores, E. From (A1) and (2) it follows that the sum of 
an element of this matrix and its image in the principal diagonal is 1: 


(3) 
By convention, é;; 


2. MODELS FOR RANKING 


As a matter of notational convenience, we shall use symbols with superscripts 
when referring to ranked items, symbols with subscripts when referring to un- 
ranked items as originally presented for consideration. Thus Ih, Iz, +--+, Lm 
are the “names” of the m items under consideration; I’, J?, - - - , I™ is a specific 
ranking of them; e* is the expected score of the i-th ranked item when com- 
pared with the j-th, etc. 

Given expected scores for items J,, Jz, - - - , Im in pairs, how are the items 
to be ranked? Approaches to this problem often involve the concept of “worth” 
of an item, in either of two conceptually distinct ways. In either case, the 
“worth” of an item is a real number assigned to that item. 

(t) The worth of an item is defined in terms of its expected scores when com- 
pared with the other items. 

(tt) Each item is asswmed to have an intrinsic worth; these intrinsic worths 
determine the expected scores. 

An assumed intrinsic worth plays a role somewhat analogous to a “main 
effect” in the analysis of variance (this is most clear in Scheffé’s model [12]). 
To assume they determine completely the matrix of expected scores is analogous 
to the hypothesis of no interaction in the analysis of variance. Such models are 
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considered in §3. In the present section is considered a general approach to the 
problem of ranking from given expected scores, including as a special instance 
the approach by definition of worth (7) above. 

The basic assumption for this approach is that if pertinent properties of, and 
relations among, the items were completely known, then it would be possible 
to say of any two proposed rankings that one is more desirable than the other. 
For example, if in contests of a certain kind, player A consistently wins over 
player B who consistently wins over C who in turn wins over A, a committee 
might be able to agree that the ranking ACB (say), in which no player is ranked 
more than one place higher than a player who can beat him, is preferable to 
the ranking ABC. Further, it will be supposed that a numerical scale of desira- 
bility could be introduced. That is, given complete information on the items, 
each ranking would have a numerical “utility” (for the investigator, or for 
some segment of the community) such that the more desirable of two rankings 
would have the greater utility. This assumption is labelled (A2): 


Each ranking of the items has a “value” or “utility”. (A2) 


An optimum ranking is then one of maximum utility. 

Ranking by worths, if they exist, can be subsumed under (A2). For example, 
if w‘ denotes the worth of item J‘ ({=1, 2, - - - , m) in the ranking J’, J?, ---, 
I™, one may take the utility of the ranking to be 1 if w!>w?> --- >w™, and 
otherwise 0. 

We consider first models under (A2) in which the concept of worth does not 
enter. A further assumption relating the expected scores matrix to the utility 
of a ranking is: 


The utility of a ranking I’, I?, - - - , I™ is a function of the expected scores 
e%, 1 <i <j <™m, which is non-decreasing in each argument. 
(Assumptions relating utilities directly to expected scores will be labelled “S”.) 


The utility of a ranking will also be referred to as the utility of the expected 
scores matrix corresponding to that ranking. 


Model A. A suggestion of Kendall [9], that the number of expressed preferences 
which are violated be minimized, motivates the following choice of a utility 
function. 


The utility of aranking I), I*?,---, I™isthesum >, >> e’4, (S2) 


t=1 jmi+1 


In order to make clear the relationship between (S2) and the stated motivation, 
suppose a single comparison is made of each pair of items. Suppose that each 
comparison results either in a tie or in a preference for one of the two items, 
and further that each expected score e“’ is estimated by the observed score s*, 
which is 0, 4, or 1. It is clear that while the estimated e* appearing in the sum 
in definition (S2) depend on the ranking, precisely the same number of scores 
} will appear for all rankings. If r scores 4 appear, then 4m(m—1)—r scores 
appear which are 0’s or 1’s. Each 0 appearing in the sum represents an ex- 
pressed preference which is violated. Since the total number of 0’s and 1’s ap- 
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pearing in the sum is the same for all rankings, a ranking which minimizes vio- 
lated preferences is a ranking for which the (estimated) utility (S2) is maxi- 
mized. (The same conclusion is reached if each tie counts as half a violated prefer- 
ence.) 


Regular utilities. Suppose there are two items J; and J; such that e4<e, for 
k=1,2,--+,m. A utility will be called regular if in each such situation a rank- 
ing in which J; precedes J; has at least as great utility as that in which J; and 
I; are interchanged. While not every utility satisfying (S1) is regular, one read- 
ily verifies that the utility defined by (S2) is regular. 

In the absence of utilities for rankings, a method of ranking by worths will 
be called regular if the utility referred to following (A2) is regular. This is 
equivalent to requiring that ex <e, for k=1, 2, ---,m imply that J; have at 
least as great worth as J;,. 

We turn now to consideration of some models in which worth is determined 
by expected scores, and in which ranking by worth is regular. 


Kendall Model ({9]). In this model the worth of item J; is defined as 


Wi= (DW1) 


j=1 


(Assumptions defining worths will be labelled “DW”.) One may say of this 
model too that, from an appropriate point of view, the number of expressed 
preferences which are violated is minimized. Consider again an experiment in 
which each item is compared once with each other, and the observed scores 
{s,;} are used to estimate the expected scores { e,;}. Suppose further that there 
are no ties, or that a tie is counted as half a violated preference. Then the first 
item in an optimal ranking under (DW1) is that item J; for which the fewest 
violated preferences are represented among the scores s;; (j=1, 2, +--+, m). 
The first item having been determined, the second is that item J, among the 
remaining m— | for which the fewest violated preferences are represented among 
the scores e,;, 7=1, 2,---, m, ete. 

With Model A, that ranking is optimal for which the total number of violated 
preferences is minimized. With the Kendall model, the first item is chosen so 
as to minimize violated preferences, then the second, etc. Thus one might 
consider Model A more appropriate in a situation in which the whole ranking 
is, so to speak, of uniform interest; while the Kendall model would be more 
appropriate in the very common situation in which it is of prime importance 
to choose the highest ranked item correctly, then of secondary importance to 
choose the second, etc. 

If (S2) were to be replaced by “the utility of a ranking J', J?, - - - , J™ is the 
sum >-7, e!,” one should obtain a model in which only the first ranked item is 
considered of importance. If the utility of a ranking J’, J?, - - - , J™ is taken to 
be the sum >» pa e*/, one hasa model in which it is considered of vital im- 
portance that the best two be segregated, but not that a choice be made be- 
tween them, etc. An optimum ranking under the Kendall model (DW1) 
would maximize utility under each of these alternatives to (S2). 
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Linear utility. These alternatives to (S2) are subsumed under a general model 
depending on the following assumption: 

there are non-negative constants c*(i, 7 = 1, 2, +--+, m) such that the 

- ' , a (S3) 

utility of the ranking I', I?, - + -, I” is the sum >> > ctle's, 
i=l jal 

One obtains Model A on setting c’’/=1 for 1<i<j<m, c‘‘=0 otherwise, and 
the two above by setting respectively c‘7=1, c'’=0 for 1>2, and c/#=1 for 
it=1, 2, c’=0 for i>2. The Kendall model optimal ranking will maximize 
utility under (S3) if c‘7=c‘ (k, 7 =1, 2, - - - , m), wherec', c?, - - - ,c” satisfy the 
condition c'>c?> --+- >c”. One might think of the relative value of c‘ as 
specifying the importance attached to correct determination of the i-th ranked 
item. 

Multiplying each c*’ by a constant has the effect of multiplying the utility 
by that constant. Further, since e*’+e?'=1, if ais any number and if, for a par- 
ticular pair (7, 7), c’ and c** are replaced by c‘/+a and c’‘+a respectively, the 
utility is simply increased by a. Suppose M is the largest of the sums c*/+c’, 
Set 


1 
vis = = [ot + H(A — 4 — e%)](G,j = 1,2, - + -,m). (4) 


The utility calculated using the {‘’} in place of the {c*‘} is a linear function 
of the utility using the {c‘#}. The utility 


nm 


od vies 


t=1 j=l 
will be called the normalized utility. One has 
O<sy#<1, vety#=1, (G7 =1,2,--+,m). (5) 
It is shown in the appendix (Theorem 1) that a linear utility is regular if and 
only if after normalization one has 
vy* > y*, yvicry for 1 <j, k=1,2,--++,m. (6) 


Under these conditions the utility satisfies (S1) also (Theorem 2). 


Kendall-Wei Model ({9]). In this model the worth w, of the 7-th item is a func- 
tion of the expected scores e;; (j=1, 2, - +--+, m) satisfying 


hw; = D> e4;w;, $=1,2,---+,m, (DW2) 
j=l 

for some positive constant \. Thus a high expected score when compared with 
a worthy item contributes more to worth than a high expected score when com- 
pared with a less worthy item. It is clear that \ must be a characteristic value, 
and w=(w1, Ws, -- +, Wm) a corresponding characteristic vector, of the ex- 
pected scores matrix. One observes that ranking by worths determined by 
(DW2) is regular. 


Dominance ranking method. G. L. Thompson [13] has studied and generalized 
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the Kendall-Wei method, introducing a “dominance ranking method.” Let D 
denote a matrix identical with the expected scores matrix except that 0’s ap- 
pear on the main diagonal; e a vector each of whose components is 1, and r a 
positive number less than the reciprocal of the principal characteristic value, , 
of F. Then a worth vector w=(w;, We, - - - , Wm) is defined by 


w= (D+rD?+r°D'+ --- de. 


The definition may be extended to the case r=1/, when Thompson finds that 
the dominance ranking method coincides with that of Kendall and Wei. 
Clearly the dominance ranking method is also regular. 


3. INTRINSIC WORTH MODELS 


In such models one assumes a priori that the items have intrinsic worths 
which determine the expected scores matrix. Such worths are analogous to 
“main effects” in the analysis of variance, and the assumption is analogous to 
the assumption of “no interaction.” The following property of intrinsic worth 
seems a desirable one. 


The expected score of an item of worth u when compared with one of 
worth v is a function e(u, v) such that u>v implies e(u, v) >e(v, u), (IW1) 
or, equivalently, e(u, v) >}. 


(Assumptions concerning intrinsic worths will be labelled “JW”.) (Logical 
relations among this assumption and others are studied in ref. [1].) It is shown 
in the appendix (Theorem 3) that a necessary and sufficient condition that rank- 


ing by intrinsic worth be regular is that (7W1) hold. Intrinsic worth satisfying 
(IW1) will be called weakly regular (as contrasted with strongly regular intrinsic 
worth, defined below). 

A stronger assumption is the following. 


The expected score of an item of worth u when compared with an item 
of worth v is a function e(u, v) which is non-decreasing in u, non-> (IW2) 
increasing in v. 


(Note that one of the requirements is redundant: if e(u, v) is non-decreasing in 
u it must be non-increasing in v, and conversely, since e(u, v) +e(v, u) =1.) 

It is shown in the appendix (Theorem 4) that a necessary and sufficient condi- 
tion that ranking by intrinsic worth coincide with the optimal ranking by every 
regular utility is that (7W2) hold. An intrinsic worth satisfying (TW2) will be 
called strongly regular. 

Condition ([W2) appears as Axiom 2 in Appendix I of [10]. 

A “quadruple condition” appears in references [1] and in [8]: ex;>e, implies 
€nj > ix. It is easy to verify that in the presence of (JW1) this condition implies 
(IW2). 

Several models specifying the form of the function e(u, v) satisfying (IW2) 
are mentioned briefly below. In some situations, however, it is not easy to 
justify, a priori, a particular form for e(u, v). For such situations a basic no- 
interaction model is proposed, in which it is assumed only that a function 
e(u, v) satisfying (JW2) exists, without specifying its form a priori. Discussion 
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of this “minimal” model is deferred till after brief consideration of some others. 
Each satisfies the following assumption with a specified function H: 


e(u,v) = H(u — v), 


where H is the distribution function of a symmetric random variable. 
That is, H is anon-decreasing function such that H(— «)=0, H(«)-=1, (1W3) 
and H(—2z)+H(z)=1. 


Condition (IW3) implies (IW2). Theorem 6 in the appendix states that (IW3) 
is essentially stronger if m>3; that is, a situation is exhibited in which (JW2) 
holds but not (JW3). This has also been proved by Block and Marschak [1]. 
Scheffé Model. In the discussion of the Scheffé model, attention is restricted to 
the case in which the order in which items are presented to a judge does not 
affect the score. Making allowance for the difference in scoring method, one 
finds that under the “hypothesis of subtractivity” the Scheffé model is specified 
by taking for H the distribution function of the uniform or rectangular dis- 
tribution over (—}4, 4): 


H(x) =$+z2, —}3<2<}, 
(I[W4) 
e(u,v) = 3+ (u—»). 


When the “hypothesis of subtractivity” fails, Scheffé defines “deviations from 
subtractivity” analogous to interactions: 


vis = ei — [4 + (Wi — Ww), 


where w; and w; are the worths of items J; and J; respectively. 

A property of (IW4) which may be disadvantageous for some purposes is 
that it does not allow for great variations in worth. That is to say that if, for 
example, é2 is near 1, then e3 can not be. For — 4 < min;,;(w; — w,) 
<max;,, ;(w;—w;) <4, and if wj>w.>ws with w,—we near 4} then 0O<w.—w; 
= (w, — W3) — (wi — we) < $ — (Wi — w2) so that we—w; must be near 0 and e3 near 
4. However, in many or most situations the items to be compared will be of 
comparable worth. One might in such situations expect the advantages of the 
linearity of the model and its close relationship to the analysis of variance to 
make it very useful. 

It is interesting to note that intrinsic worth under (JW4) is consistent with 
worth as defined by (DW1). That is to say, if w; represents the worth of J; under 
(IW4) then w; may be chosen so as to be given also, except for a constant multi- 
plier, by (DW1): 

W;i= BI > Cj. 


™ jul 


To see this, compute the right hand member, assuming (JW4): 


12 
— ey = 


+ tb + ti - ail out — = 00. 


1 
M jul ™ jul ™ jul 
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Noting that the probabilities e;; depend only on the differences w;—w;, one 
may normalize by setting 


1 m 
—> (-—w;) =0 
™ jut 
(this is also the normalization used by Scheffé). Then 
1 m 
v= > C ij. 
™ jul 


Bradley-Terry Model {2|. The Bradley-Terry model is based on the intuitively 

appealing idea that the ratio e;;/e;; should be that of the worths w;/w;: 

4;/e5 = w;/W;; or equivalentl 

j/ ej j q mt (1W5) 
ei; = w;/(w; + w)). 


After a simple change of variable, w*=In w, intrinsic worths under (JW5) 
satisfy also (IW3) with [3] 


1 z 
H(z) = _f sech? (y/2)dy. 


Thus 
e(u,v) = u/(u+v) = H(u* — v*) 
where u* =In u, v*=In v. 


Normal Distribution Model.? This model (cf. [11]) satisfies (IW3) with 


1 z 
H(z) = =e} exp (—??/2)dt. (IW7) 


—o 


Assumption (JW3) is a consequence of other assumptions ([7], p. 175) more 
directly related to some situations where the appropriateness of the model 
might be considered. In taste tests, for example, or other situations where a 
subject is presented a stimulus, one might assume that 


the response X of a subject chosen at random from the population of | 

subjects to a stimulus has a distribution with distribution function 

F(x—S), where S is a value associated with the stimulus. His response> (IW8) 
Y to another is independent of X and has distribution function 
F(y—T), where T is a value associated with the other stimulus. 


It is assumed he will prefer that with value T if Y>X. The probability of this 
is readily seen to be H(7’—S), where H is the distribution function of U—V, 
where U and V are independent random variables with distribution function 
F. Thus (JW8) implies (JW3); but not, of course, conversely, since not every 
distribution function H is of the form required by (JW8). 

Mosteller [11] points out that if F is the normal distribution, it is not neces- 





2 Professor Jacob Marschak has kindly informed the writer that this model should be attributed to Fechner: 
Gustav Theodor Fechner, Elemente der Psychophysik, 1859 (cf. especially pp. 70-103 of the 1889 edition). 
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sary to assume that X and Y are independent, only that their correlation is in- 
dependent of S and 7’, in order to arrive at (JW7). 


Basic no-interaction model. This model is based on (JW2) alone: there >xist 
intrinsic worths and a function e(u, v) giving the expected score of an item of 
worth u in comparison with one of worth v, which is non-decreasing in u, non- 
increasing in v. It follows that, given m items, 


there exists a ranking I', I?, -- +, I™ such that e<e*, e*<e* for 
l<i<j<k<m; (IW9) 


namely, ranking by worth. It is shown in the appendix (Theorem 5) that the 
converse is also true: given m items and a ranking satisfying (JW9), there can 
be found worths w!, w?, - - - , w"™ and a function e(u, v) satisfying (JW2) such 
that e(w‘, w’) =e (7, 7=1, 2, ---, m). It is further observed that total scores 
(Kendall model) s‘= pa a e” will serve as worths w’‘, as will also Kendall-Wei 
worths (Corollary 1 to Theorem 5). 


4. ESTIMATION WITH THE BASIC NO-INTERACTION MODEL 


The remarks above have been concerned chiefly with the matrix E of ex- 
pected scores, and little mention has been made of the distributions of the 
scores themselves. If a simple preference is expressed at a single comparison 
between J; and J; (a tie assumed impossible), the score S;; has an elementary 
binomial distribution: S,;;=1 with probability e,;, S;;=0 with probability 
ej,=1—e,;. The general discussion admits of arbitrary distributions, subject 


to the existence of expectations. Let k;; be 0 or a positive integer, and let k;; 
trials be made of the comparison between 7; and J; (i, 7=1, 2,---, m). If 
S,;, is the score of J; in the v-th trial, let 


go ee 
. Ki; mt i 
denote the arithmetical average of the k,; scores. If the distributions of the 
scores depend on certain parameters, among which are the expected scores, one 
may, given a specified ranking, choose the parameters so as to maximize the 
joint density function of the scores, subject to (IW9). The ranking for which 
this maximum value is largest is then the maximum likelihood (M.L.) ranking. 
Suppose it is known that the scores {8} have distributions all belonging to 
the same exponential family (cf. appendix), with means {e‘/}. In particular, 
they may all have the elementary binomial distribution, in which e” is the 
probability that the 7-th ranked item will be preferred to the j-th ranked. 
Given a ranking, and given observed values s‘”, the problem of maximum 
likelihood estimation of the e’’ subject to (JW9) is essentially a special instance 
of the problem solved in ref. [4] (ef. [5]). Consider a square array of points (i,j), 
i running from 1 to m from top to bottom, 7 running from 1 to m from left to 
right, as rows and columns are numbered in the scores matrix. However, the 
points (7, 7) ({=1, 2, - - - , m) are to be missing from this array, as are also all 
points (i, j) for which k‘/=0. In the terminology of [4] an upper interval con- 
sists of a point (7, jo) together with all points (7, 7) such that 7<7, j7>jo. An 
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upper layer is a union of upper intervals. A lower interval consists of a point 
(io, jo) together with all points (7, 7) such that i>%, 7<jo. A lower layer is a 
union of lower intervals. A layer is the intersection of an upper layer with a 
lower layer. The average score over a layer is the sum of the scores S‘ from all 
observations at points (7, /) in the layer divided by the total number of ob- 
servations at these points. 

The solution as given in ref. [5], §5, may be described as follows. Consider the 
upper layer over which the average score is largest. If there are two or more 
upper layers having the same largest average score, so does their union, which 
is the maximal upper layer, U', over which the averuge score is maximal. For each 
point (7, 7) in U', the maximum likelihood estimate (MLE) é* of e* is precisely 
this maximal average score. Consider now those layers which are intersections 
of upper layers with the complement of U'. Among these layers again choose 
the maximal layer, U?, over which the average score is maximal. At each point 
of this layer the MLE é‘ of e* is again this maximal average score. Consider 
next layers which are intersections of upper layers with the complement of 
U'U0?, ete. 

This method of solution is illustrated in the example below, and in the 
appendix, Theorem 7. 

One obtains the same solution starting at the lower left instead of at the 
upper right. That is, one may choose first the maximal lower layer, L', over 
which the average score is minimal. The MLE é* of e*' for each point (7, 7) in 
L’ is precisely the average score over L'. Consider next layers which are inter- 
sections of lower layers with the complement of L’, etc. 

Since S#”+S/”=1 (v=1,2,---, k%;7,7=1, 2, - - - , m, 147), the layers L 
and U' are symmetrically situated with respect to the line i= 7 (the principal 
diagonal) as are also the layers L? and U?, etc. Moreover the sum of the average 
scores over L' and U' is 1, the sum of the average scores over L? and U? is 1, 
ete. Thus @#+é/*=1 (i#j, 1, j7=1, 2, ---, m). If a layer on which the MLE 
is constant contains a position (7, 7) (a diagonal position), the average score 
over that layer must be }. In any event one may, without violating (IW9), 
adopt the convention é*= 3. 

It is true that in ref. [4] (ef. [5]) the random variables S‘” at different points 
(7, 7) are assumed independent, while here S’” = 1—S*”. The solution described 
can be justified, however, on the basis of consideration of observations at 
points above the principal diagonal (i<j). 

For score matrices of moderate size, these calculations are rapid and easy to 
carry out, given an assumed ranking. One can then calculate the value of the 
joint density function for each ranking, and choose that ranking for which the 
joint density function assumes its largest value. 

It is observed, [6], p. 444, that when only order restrictions are made on the 
parameters (expected scores), the MLE’s are independent of the particular 
exponential family to which the distributions of the scores all belong. The order 
restrictions there contemplated are of the form: “e/<e™ and e“*<e” and 

- +.” The present situation is an instance, provided a ranking is specified; 
but before a ranking is specified, the order restrictions on the e;; here are of the 
form: “ej; <ers ANd Cy» Swe aNd +++, OF €gsSe-q and e,,<e% and ---,” and 
are thus of a kind to which the assertion in ref. [6] does not apply. Indeed, it is 
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shown in the appendix (Theorem 7) that the MZ ranking may depend on the 
particular exponential family to which the distributions of all scores belong. 
In particular, the least squares estimates need not coincide with those obtained 
under the assumption that all scores have the elementary binomial distribution. 

The following property of estimation of ranking under the basic no-inter- 
action model, related to regularity, will often make it unnecessary to consider 
the maximum value of the joint density function for all possible rankings. It 
is proved in the appendix (Theorem 8) that if the distributions of all scores 
belong to the same exponential family, if k;,=k>0O (7, 7=1, 2, --+ +, m, i), 
and if for some pair i, 7 we have s,;,><8;, for y=1, 2, - - - , m, then J; precedes I; 
in a ML ranking. As a corollary, if the scores matrix is reducible, items cor- 
responding to the rows distringuished by 0’s rank below the others in a ML 
ranking. (The scores matrix is reducible if there are a positive integer r less 
than m and a ranking such that in the corresponding scores matrix an r X (m—r) 
matrix of 0’s stands in the lower left corner and an (m—r) Xr matrix of 1’s in 
the upper right corner.) 

The following simple example illustrates the method and points up the 
difference between (JW2) and (IW3). Of three items A, B, C let A be preferred 
to B, B to C, and C to A. (Each score is assumed to have the elementary bi- 
nomial distribution.) Maximum likelihood estimates of the worths of A, B and 
C under (IW3) for a specified H (or simply assuming (JW3) satisfied for some 
unspecified function H) would be equal, and each of the six possible rankings 
would be a ML ranking. 

For the ranking ABC, the observed scores matrix is 


A 


B C 
B 


C 


One has k‘/=1 for 7, 7=1, 2, 3, 47. MLE’s under (IW9) for the ranking ABC 
are pan = pac=pac=2/3, Daa=pca=pcr=1/3. The corresponding value of the 
joint frequency function is [(2/3)1(1/3)°]*[(2/3)°(1/3)!]? =2*/3®. For the rank- 
ing ACB 

A C B 


A on U'= ZB 
C U2 = L? 


ee oe 
L! = U3 2 
bac= bea =bces=pac=}, bas=1, Poa =0. The corresponding value of the joint 
frequency function is [(})°(4)!]*[140° ]? =1/2'>2*/3®. Thus ML rankings under 
(IW2) are ACB, CBA, and BAC. In none of these rankings is an item ranked 
more than one place higher than an item which was preferred to it. 


5. MULTIPLE COMPARISONS 


Much of the above discussion can be modified so as to apply to situations 
involving multiple comparisons, Discussion here is limited to brief mention of 
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the adaptation of the basic no-interaction model to a situation where each 
judge is asked to choose the “best” of three items presented to him. Let pi; 
denote the probability that J; will be preferred to both J; and J,. Then 


Dikj = Pie And Pie + Dj + Peig = 1. 
In addition, in the basic no-interaction model, 
Dijk 18 NON-increasing in i, non-decreasing in j and in k. (1W 10) 


The problem of estimating the p‘”* subject to (7W10), given a ranking, is again 
essentially a special instance of the problem solved in ref. [4] (ef. [5]), involving 
a three-dimensional array. 


APPENDIX 


THEOREM 1. A necessary and sufficient condition that a linear utility be regular 
is that after normalization y* >y*, y*'<y*i for i<j, k=1, 2, - - - , m. (Cf. (§3).) 


PROOF OF SUFFICIENCY. The change in the expected scores matrix EF brought 
about by an interchange of two items in a ranking can be thought of as ac- 
complished by successive interchanges of elements of Z: two in a common row, 
two in a common column, or two which are images of each other in the principal 
diagonal. Note that if there are two rows of E such that each element in one 
row is not greater than the element in the same column of the other row, then 
the diagonally opposite elements in these rows satisfy the same inequality (the 
one in the row of smaller elements being not greater than 3, the other not 
smaller than 4). It therefore suffices to consider a single interchange of two ele- 
ments of ZH. Suppose i<j, r>s; then y*<y‘. Suppose e*”>e*. Then ye" 
+-+yitei* > yire*+y*e"". Repeated application of this observation completes the 
proof of the sufficiency of the condition. 


Proor OF NECEssiTy. Assuming the linear utility regular, it is shown first 
that if i<j then y* >’ for r=1, 2, - - - , m. Suppose the contrary, that there 
exist 7, j7, r with i<j such that y"<y*. Since y“+7y"=1 (A, w=1, 2, +--+, m), 
it follows that y"*>-y”. 

Suppose first that r~7, r~7. Consider the expected scores matrix E in which 
e“=4 for \, w=1, 2, ---, m, except that e* =}+h, e*=4—h (0<h<4). Con- 
sider also the expected scores matrix FE, obtained by interchanging items I‘ 
and I’. Every element e in E, is 3}, except e¥ =4+h, e/=4—h. Let U(E), 
U(E;) denote the utilities of the two expected scores matrices. Then U(E;) 
—U(E) =y*h—y"h— (y*h—y"h) =h[(y*— 7") +(v!—y")]>0. Thus EF, has 
greater utility than Z, though each expected score of J‘ is at least as large as 
the corresponding expected score of J’. This contradicts the assumed regularity 
of the utility. 

If r= or r=j one arrives similarly at a contradiction, starting with an ex- 
pected scores matrix Z in which each element is } except that e?/=4$-+A, e* 
=3—h. 

Since” y“+y"=1 (A, w=1, 2,---, m), it follows from the inequality 
yi >" (r=1,2, -- +, m) that also y*<y” (r=1, 2, - + - , m) if i<j. This com- 
pletes the proof of the theorem. 
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THEOREM 2. Under the hypotheses of Theorem 1, the linear utility satisfies (S1). 


Proor. One has 


SSeS ES ov +E YE wt Vwss 
t=1 j=l i=l j=i+l t=1 j=i+l1 i=1 
so that the utility is a non-decreasing function of the e”, 7>7, if and only if 
y>vy*' for 7>7. This condition on the -y“ is cleerly implied by the hypotheses 
of Theorem 1 (set k=). 


THEOREM 3. A necessary and sufficient condition that ranking by intrinsic worth 
be regular is that (IW1) hold. 


PROOF OF NECESSITY. Suppose the contrary, that ranking by worth is regular, 
but that there exist u, v such that uw>v, e(u, v)<}. Let J, have worth u, 
I,, +++, Im have worth v. Then en =}, ¢%<}, k=2, 3, +--+, m, while e,>3, 
€x =}, k=2,---, m. Since u>v», J, ranks above I2, contradicting regularity. 


PROOF OF SUFFICIENCY. Suppose (IW1) holds, and suppose ex >ex, k=1, 2, 
, m. Then a fortiori e;;>e;;=}. That is, e(w;, wj) >}, hence w;>w;. Thus 
the ranking is regular. 


THEOREM 4. A necessary and sufficient condition that the ranking by intrinsic 
worth coincide with the optimal ranking by every regular utility is that e(u, v) 
satisfy (IW2). 


Proor. The sufficiency is evident. To prove the necessity, suppose the con 

trary, that ranking by intrinsic worth coincides with that by every regular 
utility, but that there exist u, v, r such that u>r, e(u, v) <e(r, v). Consider first 
the case in which u>r>v. Let J; have worth u, I, worth r, Is, In, +--+, Im 
each worth v. Then ey =e(u, v) for k=3, 4, ---, m, and ex =e(r, v) >e(u, v) 
=e, for k=3, 4, - - - , m. Choose c* so that c*=1 for k=3, - - - , m, otherwise 
ci=0. One verifies that the utility }>; >>; ce” is regular (Theorem 1), but in 
an optimal ranking by this utility J, precedes J,. The proof for the other cases 
is similar. 

TuHeEoreM 5. If (IW9) holds then there exist worths w', w?, - - - , w™ and a function 
e(u, v) satisfying (IW2) such that e(w', w’) =e" (t, 7=1, 2, - ++, m). 


Proor. Given (JW9), choose positive numbers w!, w?,---, w™ such that 
w'>w*> --- >w™. At points (w', w’), (¢<j, in the u, v plane define e(w‘, w’) 
=e, At other points in the region w!>u>v>w™ define e(u, v) so as to be 
locally planar; that is, so that any triple of adjacent points (in three-space) 
(wi, w’, e%), (wi!, wi, ef 17), (wi, wit!, ef +1) or (wi, w, ef), (wt, wit!, ef s+), 
(wit!, wit!) e'+17+1) determine a triangle which, with its interior, lies in the 
graph of e(u, v). These triangles make up the graph of e(u, v) for w!>u>v>w. 
For w!'>v>u>w” one defines e(u, v) =1—e(v, u), and verifies that e so defined 
is non-decreasing in uw, non-increasing in v. 


Corouuary 1. Kendall worths or Kendall-Wei worths will serve as worths if 
(IW9) holds. 
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It is clear in the proof that any determinations of the worths satisfying 
w'>w?> --- >w™ will serve, the function e(u, v) being defined after worths 
have been so chosen. Kendall worths, w'= >>; e*, satisfy the required inequali- 
ties, given (JW9). So also do Kendall-Wei worths, for if \ is the largest positive 
characteristic value of E and x=(z', -- - , 2™) a corresponding non-negative 
vector, then Azti= ); et -izi < Do; etizi=dzri, 


THEoreM 6. Condition (IW2) (or equivalently (IW9)) does not imply (IW3). 
(This is proved by Block and Marschak [1].) 


Proor. In the following situation (JW9) holds but not (IW3). Set m=4, and 
let e? (i, 7=1, 2, 3, 4) be expected scores such that e!?<e**<e*% <e*<el’<e™, 
Then (IW9) is satisfied. But if there exists a distribution function H such that 
(IW3) holds, then e'?<e** implies H(w!—w*)<H(w?—w*) which implies 
w'—w? <w*—w or 2w?—w'>w!. Similarly w?—w* <w'—w', and 2w*—w?>w’, 
whence w?+w'>w!+w*. But also w?— wt <w!—w', a contradiction. One obtains 
easily a similar counterexample for arbitrary m>3. 

Theorems 7 and 8 require certain properties of exponential families (for 
references cf. [6]). Let Xo be a random variable whose moment-generating func- 
tion exists on an open interval containing the origin (in particular, any bounded 
random variable), and let Fo(z) be its distribution function. Set 


6) = f erdF (2),  @(r) = log (1). 
(--2 00) 


Fis; = f exp [ur — @(r) ]dFo(u) 
(—o 2) 


is the member corresponding to the parameter value r of the exponential family 
of distribution functions determined by F(z). Unless Xo is degenerate, O(r) is 
strictly convex, its derivative 6(r) strictly increasing. If X, has distribution 
function F(x; r), then E(X,)=@(r). Let 7(@) be the inverse function of @(7), 
and define 


T(6) = f ‘e(o)do, 


where 6)9=6(0); 7'(@) is also strictly convex. Suppose that the distributions of 
independent random variables X;,, v=1, 2, ---, ki, i=1,%, +--+, r all belong 
to the same exponential family; that E(X,,) =0,;, v=1,2,--+-,k,i=1,2,---, 
r; that z,, represents an observed value of X,,; and that 


tem 2s Bie 


with respect to an appropriate measure independent of the parameter values 
6;, the logarithm of the joint density function of the random variables X;,, 
at the observed values is given by 


d kif) — O[r(6)]}. 
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The MLE of {6;} therefore minimizes 


> kit T (%,) + O[r(0;) seed E17 (0;) } ° 


Given a ranking and a corresponding observed scores matrix, let L', L?,---, 
L’ be regions in which the MLE of the e* are the constants s', s?, - - - , s* re- 
spectively. Let n” be the total number of observations at a point in L’, 
v=1,2,-+-,r. Then the value of (7) for the MLE’s { é*} of the {e*} is 

Dd ki{ T(s*4) + O[r(€*4)] — s*#r(6*s)}. (8) 
i,j 
From the definitions of the functions involved one has the identity 


T (0) + O[r()] — 6r(6) = O. 


Hence 
@[7(s’)] — s’r(s”) = — T(s’), v=1,2,---,7r. 


Since é7=s” for (7, 7) in L’ and since s’ is the average of the s‘/ over L’, (8) is 
equal to 


>| D kT (8's) - wre’) |, (9) 
veil ijeLr 
THEOREM 7. If the random variables S*” all have distributions belonging to the 
same exponential family, the maximum likelihood ranking under the basic no- 
interaction model may depend on the exponential family. (Cf. Section 4.) 

Proor. An example is given of an observed scores matrix in which one ranking 
is the ML ranking if the S“” all have elementary binomial distributions, while 
another is the ML ranking if the X*” are all normal with a common variance. 
(In the latter case the ML ranking and the least squares ranking coincide, for 
then T(#)+@[r(@) | —#r(@) =(—6)?2/20?, where o? is the common variance.) 
Let h be a small positive number, and let the observed average scores matrix be 
as follows 


LA 


for ranking ABC, k observations having been made on each pair. For ranking 
BAC, one has the average scores matrix 


B A Cc 


B 3+h 
A wigeathe 


C h 
Li 
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Under ABC the MLE’s are éap =épa = 3, ésc= 1—écu4 = 5. égc= 1 —écpz =1—-h. 
Under BAC the MLE’s are éz,= 1 —ésp= $+h, épc =bac= 1 —éca ==] —écp 
=1—h. For the elementary binomial distribution we have 

T(@) = @log @ + (1 — @) log (1 — @) + log 2. 
The value of (9) for ABC is 


k{4(0) + 2[(4 — h) log (} — h) + (4 +) log (3 + h)] — 2[} log 4 + 4 log 3]} 
= 2k[(} — h) log (4 — hk) + (4 +h) log (3 + h) + log 2]. 
For BAC the value of (9) is 
k{2(0) + 2[hlogh + (1 — h) log (1 — h) + 0] 
— 4[(1 — 3h) log (1 — 4h) + 4h log 4h]}. 
The difference is 
2k{ (4 — h) log (4 — h) + (4 +h) log (§ + h) + log 2 — hlog h 
— (1 — h) log (1 — h) + 2[(1 — 4h) log (1 — 4h) + 4h log 4h]} 
= 2k{ (4 — h) log (1 — 2h) + (4 +h) log (1 + 2h) — hlog2 
— (1 — A) log (1 — h) + (2 — h) log (1 — 4h)}. 


The limit as h-0 of this difference divided by h is —log 2<0, hence for suffi- 
ciently small positive h, ABC is the ML ranking. 
If the {S*”} are all normally distributed with variance o?, then for ABC the 
value of (9) is 
k kh? 
ee {2h?} = ’ 
20? a? 
while for BAC the value of (9) is 


= {4@ny'} =, 
20? 20? 
so that BAC is the ML ranking. 

THeoreEM 8. If kj;=k>1 for i¥j,i, 7=1,2, +--+, m,tf sa>S8y,v=1,2,---,m, 
and if all scores have distributions belonging to the same exponential family, then 


I, precedes I, in the ML ranking under the basic no-interaction model. (Cf. Section 
4.) 


Proor. We consider a ranking in which 7,=I', I,=I', i<j, and prove that 
interchanging 7, and J, can not decrease (8). Let e‘/ denote the MLE’s after 
interchange. The minimum value of (8) for the original ranking is therefore not 
greater than 


k >> T(s"*) + O[r(ér)] — srr (er). 


rt 


Also, é">é*", é"'<é7, j=1, 2, - +--+, m. The terms of this sum affected by the 
interchange are, before interchange, 
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k >» T (Sar) + @[7(é7) | - Sart (é'") 


r¥i,j 


+k Do T (sor) + O[r(é**)] — 8-7(é*) 


ring 


+k >» T (Sar) + @[7(é"*)] — 8rq7 (é"*) 


r¥ij 


+k a T (8-5) + @[r(é74)] — 8,7(é"?) 


+ k{T (sas) + O[r(2)] — savr(6e*)} 
+ k{T (Sea) + O[r(é%*)] — ssar(é**)}. 


After interchange they are the same, except that é" and é are to be inter- 
changed, as are é"' and @”, as well as é” and é’*. The difference is 


—k > (sar — sor) [r(8i7) — r(657)] + k(8an — 80a) [7 (24) — r(6%*)] < 0, 


rij 


since r is strictly increasing. Therefore the minimum value of (8) for the original 
ranking is not greater than the minimum value of (8) after interchange. 

The scores matrix is reducible if there are a positive integer r less than m and 
a ranking such that in the corresponding scores matrix an rX(m—r) matrix 
of 0’s stands in the lower left corner and an (m—r) Xr matrix of 1’s in the upper 
right corner. 


Coro.uary. Given a reducible observed scores matrix, the ML ranking of items 
corresponding to the rows distinguished by 0’s is below that of others, if the distribu- 
tions of the scores all belong to the same exponential family, and if k;;=k>1 for 
t%j,1,j=1,2,---,m. 


Proor. If J, is an item corresponding to a row distinguished by 0’s and J, an 
item corresponding to a row not so distinguished, then either s,,=O<s,, or 
So» S 1 = Say. 
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TABLES OF CONFIDENCE LIMITS FOR THE 
BINOMIAL DISTRIBUTION 


JAMES PACHARES 
Hughes Aircraft Company 


Values of p (times 10,000) satisfying the equation 
2 n 
a= © (*)ra-p- 
r=0 r 


are given to four figures for n =55(5)100, x=0(1)n—1 and a=.005, 
.010, .025, .050. 


1, INTRODUCTION 


IVEN that there were x successes in n independent binomial trials, each 

having fixed probability of success, p, find (1—2a) 100% confidence 
limits for p of the form pi<p<p2. Neyman [7] has shown that since z is dis- 
crete the probability of being correct in stating p1< p<» is at least equal to 
(1—2a). If we desire confidence limits of the “central” type, then one would 
find p; by solving equation (1) and p by solving equation (2), where 


" x (") pr(1 — py" (1) 


r=<2 


=, /n 

an > ( pra “er. (2) 
rad \T 

It is only necessary to solve one of these equations, (2) say, for if we replace 

xz by n—z and p by 1—p in (2) the solution will be p;. “Shorter” two-sided con- 

fidence limits for p have been computed by Crow [2], which, however, do not 

give efficient one-sided limits. 

Table I gives to four significant figures the values of p (times 10,000) satis- 
fying (2) for n=55(5)100, x=0(1)n—1, and a=.005, .010, .025, .050. If r=n 
the value of p is always unity and is therefore not tabled. 

If the percentage points of the beta distribution are available, then it is well- 
known that p; and p. may be found from 


a=I1,(z4,n-—2+1) (3) 
a =1;,,(n—2,x+ 1) (4) 


I, (a, b) = fora — t)>'dt/B(a, b). 


The values of p satisfying (2) have been found using the asymptotic series 
in powers of (n—2/2)-* given by equation (5.4) in Wise [9] if n>42+3, and 
using equation (7.1) in Wise [9], but due to Carter, if n<4z+3. Of the 20 
values in Table I that can be checked using (4) without interpolation in Table 
16 of [8], the maximum error found was three units in the fourth significant 
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figure. This occurred at n=65, r=59, a=.05 where use of (4) gave p2.=.9590 
while use of (7.1) gave p2=.9593. This exact check is possible for five qairs of 
(n, x), namely (60, 0), (65, 5), (70, 59), (65, 59), (60, 59). 


2. SOME USES FOR THE TABLE 


A prime contractor receives shipments of parts from many different vendors. 
The number of defectives, z, in a random sample of size n is noted and con- 
fidence limits for, p, the per cent defective in the population are determined 
assuming that zx follows a binomial distribution. Usually one-sided upper limits 
are desired of the type p< pe, and if p. turns out to be greater than some pre- 
determined value, the vendor involved may either be put on a “probation” 
list or dropped entirely. It is important in vendor rating and quality control 
problems to have tables of p. to facilitate the making of decisions. The tabu- 
lated values are of use in many other situations such as those mentioned in 
references [1], [3], [5]. 

Use of the table in finding p,; does not guarantee four significant figures in all 
cases. Suppose, for example, that n=55, c=5 and a=.05. We find from the 
table that pz=.1817 and replacing x by n—z we find that p; = 1 —.9638 = .0362. 
Here p; is found to only three significant figures. The probability of being cor- 
rect in stating .0362<p<.1817 is then at least .90. 


3. INTERPOLATION IN THE TABLE 


In quality control work samples of five are quite common so that as data is 
accumulated the total sample size at any stage would be a multiple of five. 
This was the main reason for using a step size of five in n, in which case no 
interpolation would be needed. If interpolation with respect to n is desired, it 
turns out that linear interpolation in p is surprisingly good, yielding three sig- 
nificant figures in several cases tested. Linear interpolation in log p with respect 
to n was even better, yielding a maximum error of three in the fourth significant 
figure; however, this is somewhat inconvenient requiring use of logarithm and 
exponential tables. Use of five-point Lagrangian interpolation coefficients as 
tabulated in [6] yielded full accuracy of interpolates in several cases tested and 
is the recommended interpolation scheme. 


4. RELATED YABLES 


Mainland [5] lists to either two or three figures values of p satisfying equa- 
tions (1) and (2) for a =.10, .025, .005 and for the following values of x and n: 
Tables IA and IB x=0(1)20 and selected values of n up to 1000 in varying 
step sizes; Table II the percentage of x’s in the sample =.1(.2).7, 1(1)3, 5, 7.5, 
10(5)50 and selected values of » up to 100,000 in varying step sizes. 

Grubbs [3] gives to three significant figures the values of p satisfying 
.95=I_,(n—c, e+1) (his Table I), and .10=J,;_,(n—c, c+1) (his Table IT) 
for n= 1(1)150 and c=0(1)9. His Table II represents an extension of our Table 
I. It is seen from his Table I that there is an overlap in our tables when a=.05 
for rc=n—10, n—9,---, n—1. On comparing the points of overlap, one 
must remember to replace his c with n—x—1 and to subtract his tabled 
entry from unity. Grubbs [3, p. 246] states “... tables... for which 
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I,_,(n—c, c+1) =.99, .95, .90, .10, .05, .01 etc. would be of considerable prac- 
tical use.” Our Table I fulfills in part at least this requirement. 

Clark [1] tables to four significant figures the values of p satisfying (1) for 
n=10(1)50, x=1(1)n and a=.005, .010, .025, .050. 

Hald [4] gives to three decimals 95% and 99% two-sided confidence limits 
for p for selected combinations of n and z. 

Pearson and Hartley [8] give in their Table 16, to five significant figures, 
values of p for which a=1,(v2/2, 1/2) for a=.50, .25, .10, .05, .025, .01, .005 
and for selected combinations of »; and v.. We can solve (1) for p,; by putting 
y=2(n—2z+1), »=22. We can solve (2) for p~: by putting »=2(2+1), 
v= 2(n—2) and subtracting the tabled entry from unity. 

The equation a=/,(a, b) can be solved graphically for either a or p to ap- 
proximately two significant figures in the region b<60, a<200 using the charts 
of Table 17 in [8] when a>b. If a<b we use the relation J,(a, b) =1—I;_p(b, a). 
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TABLE I. VALUES OF p (TIMES 10,000) SUCH THAT 


a=> (")ra-p 


r= 


n=55 




















6,265 
6,438 
6,609 
6,779 
6,948 
7,115 
7,280 
7,444 
7,606 
7,767 
7,926 
8,083 
8,238 
8,392 
8,543 
8,691 
8,837 
8,981 
9,121 
9,257 
9,389 
9,517 
9,638 
9,751 
9,853 
9,939 
9,993 
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TABLE I—(cont.) 
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TABLE I—(cont.) 








.005 








8,162 
8,286 
8,408 
8,528 
8,646 
8,762 
8,876 
8,987 
9,096 
9,202 9,127 
9,305 9,235 
9,404 9,340 
9,500 9,442 
9,592 9,540 
9,678 9,633 
5,668 ¢ ¢ 9,759 9,720 
5,819 9,832 9,801 
5,968 ¢ : 9,897 9,874 
6,116 ¢ 9,950 9,935 
6 263 ¢ f 9,986 9,979 
6,408 ‘ 10,000 9,999 
6,552 
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TABLE I—(cont.) 
n=70 
— 











729.0 
1,014 
1,258 
1,481 
1,692 
1,893 
2,087 
2,275 
2,458 
2 ,637 
2,813 
2,985 
3,155 
3,322 
3,486 
3,649 
3,809 
3,967 
4,124 
4,279 


@S2NOunrhwndrK © 


4,734 
4,882 
5,030 
5,176 
5,320 
5,463 
5,605 
5,746 
5,886 
6 ,024 
6,161 6,018 
6,297 6,155 
6 ,432 6 ,292 
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TABLE I—(cont.) 









































n=75 

we 2 Pa 
005 010 025 050 4 005 .010 025 .050 

z Zz \ 

 __—a a ela \ 
0 | 68.1 595.6 480.0 391.6 | | 38 6.575 6,441 6,242 6,067 
1 949.3 852.6 720.6 616.9 | | 39 6,699 6,566 6,369 6,196 
2 1,178 1,074 930.3 815.7 | | 40 6,821 6,691 6,495 6,324 
| 3 1,388 1,278 1,125 1,001 | | 41 6,943 6,814 6,621 6,451 
| 4 1,585 1,471 1,310 1,179 | 42 7,064 6,936 6,745 6,578 
5 1,774 1,655 1,488 1,351 | 43 7,183 7,058 6,869 6,704 
| 6 1,957 1,834 1,660 1,518 | 44 7,302 7,178 6,993 6,829 
7 2,134 2,008 1,829 1,682 | 45 7,419 7,298 7,115 6,954 
| 8 2,306 2,177 1,994 1,842 46 7,536 7,416 7,237 7,078 
9 2,475 2,343 2,156 2,001 47 7,651 7,534 7,357 7,201 
10 2,640 2,507 2,316 2,157 | 48 7,765 7,651 7,472 7,938 
11 2,803 2,667 2,473 2,311 | | 49 7,878 7,766 7,596 7,445 
12 2,963 2,825 2,628 2,463 50 7,990 7,880 7,714 7,566 
13 3,120 2,981 2,781 2,614 51 | 8,101 7,994 7,831 7,686 
| 14 3,275 3,135 2,933 2,763 52 | 8,210 8,106 7,947 7,805 
15 3,429 3,287 3,083 2,911 | 53 8,318 8,217 8,062 7,924 
| 16 3,580 3,438 3,232 3,058 | 54 8,425 8,327 8,177 8,041 
17 3,730 3,587 3,379 3,204 | | 55 8,531 8,435 8,290 8,158 
18 3,878 3,734 3,525 3,348 56 8,634 8,543 8,401 8,274 
19 4,024 3,880 3,670 3,492 57 | 8,737 8,649 8,512 8,388 
20 4,169 4,024 3,814 3,635 | | 58 8,838 8,753 8,621 8,502 
21 4,313 4,168 3,956 3,776 | | 59 | 8,937 8,856 8,729 8,614 
22 4,455 4,310 4,098 3,917 | 60 9,034 8,957 8,836 8,726 
23 4,596 4,451 4,238 4,057 | 61 9,129 9,056 8,941 8 , 836 
24 4,736 4,590 4,378 4,196 | 62 9,222 9,153 9,045 8,944 
25 4,874 4,729 4,517 4,334 | 63 9,313 9,248 9,146 9,051 
26 5,011 4,867 4,654 4,472 | 64 9,401 9,341 9,246 9,156 
27 5,147 5,003 4,791 4,609 65 9,487 9,432 9,343 9,260 
28 5,283 5,139 4,927 4,745 66 9,569 9,519 9,438 9,361 
29 5,416 5,273 5,062 4,880 | 67 9,648 9,603 9,530 9,460 
30 5,549 5,407 5,196 5,015 | 68 9,722 9,683 9,618 9,555 
31 5,681 5,539 5,330 5,149 | | 69 9,792 9,758 9,703 9,648 
32 5,812 5,671 5,462 5,282 | | 70 9,855 9,828 9,782 9,736 
| 33 5,942 5,802 5,594 5,414 | | 71 9,911 9,891 9,855 9,818 
| 34 6,070 5,931 5,725 5,546 72 9,956 9,943 9,919 9,893 
| 35 6,198 6,060 5,855 5,678 73 9,988 9,982 9,970 9,955 
| 36 6,325 6,188 5,985 5,808 74 | 10,000 9,999 9,998 9,995 


37 | 6,450 6,315 6,114 5,938 
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TABLE I—(cont.) 
n=80 
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TABLE I—(cont.) 
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1,161 
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4,520 

4,641 
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4,882 

5,001 
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5,552 5,355 ¢ 

5,668 5,472 5,303 | 9,808 

5,784 5,589 5,420 } 9,872 

5,898 5,704 5,536 9,950 9,929 
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TABLE I—(cont.) 





























n=90 
a a 

.005 .010 .025 .050 .005 .010 .025 .050 

| 2 x 
0 571.7 498.8 401.6 327.4 45 6 ,382 6,258 6,073 5,912 
1 796.7 714.9 603 .6 516.3 | 46 6 ,487 6 ,364 6,180 6,020 
2 989.7 901.4 779.8 683 .0 47 6,591 6,469 6,287 6,128 
3 1,167 1,073 943.4 839.0 48 6 ,694 6,573 6 ,393 6,235 
j 4 1,334 1,236 1,099 988 .2 49 6,797 6 ,677 6,499 6,342 
5 1,494 1,392 1,249 1,133 50 6,899 6,781 6 ,604 6,449 
6 1,648 1,543 1,395 1,273 | 51 7,001 6 , 884 6,708 6,555 
: 1,799 1,690 1,537 1,411 | 52 7,102 6,986 6,812 6 ,660 
8 1,945 1,834 1,677 1,547 53 7,202 7,088 6,916 6,765 
2,089 1,975 1,814 1,680 54 7,301 7,189 7,019 6,870 
10 2,229 2,114 1,949 1,812 | 55 7,400 7,289 7,122 6,974 
11 2,368 2,250 2,082 1,942 56 7,498 7,389 7,224 7,078 
12 2,504 2,384 2,213 2,071 57 7,596 7,488 7,325 7,181 
13 2,638 2,517 2,343 2,198 58 7,693 7,586 7,426 7,284 
14 2,771 2,648 2,472 2,324 | 59 7,789 7,684 7,526 7,386 
15 2,902 2,778 2,600 2,450 60 7,884 7,781 7,626 7,488 
16 3,032 2,907 2,726 2,574 | 61 7,978 7,878 7,725 7,590 
17 3,160 3,034 2,851 2,698 62 8,072 7,973 7,824 7,691 
18 3,287 3,160 2,975 2,820 | 63 8,165 8,068 7,922 7,791 
19 3,413 3,285 3,099 2,942 64 8,257 8,162 8,019 7,891 
20 3,538 3,408 3,221 3,063 | 65 8,348 8,256 8,115 7,990 
21 3,661 3,531 3,343 3,183 66 8,438 8,348 8,211 8,088 
22 3,784 3,653 3,464 3,303 67 8 ,527 8,440 8,306 8,186 
23 3,905 3,774 3,584 3,422 68 8,616 8,531 8,401 8,284 
24 4,026 3,894 3,703 3,541 69 8,703 8,621 8,494 8,381 
25 4,146 4,014 3,822 3,659 | 70 8,789 8,710 8 ,587 8,477 
26 4,264 4,132 3,940 3,776 | 71 8,874 8,798 8,679 8,572 
27 4,382 4,250 4,057 3,893 72 8,958 8,884 8,770 8 ,666 
28 4,500 4,367 4,174 4,009 73 9,041 8,970 8 ,860 8,760 
29 4,616 4,484 4,290 4,124 74 9,122 9,055 8,949 8 ,853 
30 4,732 4,599 4,406 4,240 75 9,202 9,138 9 ,037 8,944 
31 4,847 4,714 4,520 4,354 | 76 9,280 9,219 9,124 9,035 
32 4,961 4,829 4,635 4,468 | 77 9 ,357 9,300 9,209 9,125 
33 5,074 4,942 4,749 4,582 | 78 9,432 9,378 9,293 9,213 
34 5,187 5,055 4,862 4,695 | 79 9,505 9,455 9,375 9,300 
35 5,299 5,168 4,974 4,808 | | 80 9,575 9,529 9,455 9,386 
36 5,410 5,279 5,087 4,920 81 9 ,643 9,601 9,534 9,470 
37 5,521 5,390 5,198 5,032 | 82 9,708 9,671 9,610 9,551 
38 5,631 5,501 5,309 5,144 83 9,770 9,737 9 ,683 9,631 
39 5,740 5,611 5,420 5,255 | 84 9,827 9,799 9,753 9,707 
40 5,849 5,720 5,530 5,366 85 9,880 9,857 9,819 9,780 
41 5,957 5,829 5,640 5,476 86 9,926 9,909 9,880 9,849 
42 6 ,064 5,937 5,749 5,586 | | 87 9 ,964 9,953 9 ,933 9,911 
} 43 6,171 6,044 5,857 5,695 | 88 9,990 9,985 9,975 9,963 
| 44 6,277 6,151 5,965 5,804 | | 89 | 10,000 10,000 9,998 9,995 
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TABLE I—(cont.) 
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9,185 
9,262 
9 ,338 
P | 9,412 
4,843 ¢ ‘ 9,484 § 
4,950 , ¢ 9,555 ¢ 9,419 

5,056 88 9, 9,623 ¢ 9,498 
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TABLE I—(cont.) 
n=100 
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A METHOD OF ANALYZING LOG-NORMALLY DISTRIBUTED 
SURVIVAL DATA WITH INCOMPLETE FOLLOW-UP* 


MANNING FEINLEIB 
State University of New York 


A method is presented for analyzing survival data with incomplete 
follow-up when there is reason to suppose that the data have a trans- 
lated log-normal distribution. The life table method is used to make 
allowances for those patients lost to observation and for tabulating 
the resulting survival frequencies. Modified maximum likelihood equa- 
tions and short-cut procedures are presented for calculating the 
parameters of the general three parameter log-normal distribution from 
grouped data as represented by the life table. 


1, INTRODUCTION 


ANY variables encountered in practice have distributions which can be 
more or less closely approximated by theoretical frequency curves. When 
the approximation is known to be close, the theoretical frequency curve pro- 
vides a compact way of summarizing the data in terms of a few parameters and 
provides a smooth curve for describing the observed distribution. Furthermore, 
parametric distributions lend themselves to methods of testing for differences 
between populations which are usually more powerful than corresponding non- 
parametric techniques and, in some instances, the mathematical distribution 
may help to elucidate the nature of the phenomenon under investigation. 
Several investigators (e.g. Boag [1], Lea [7], Osgood [11, 12, 13], and Tivey 
[13, 14, 15]) have observed that the distributions of duration of survival in 
several diseases can be rather closely approximated by a log-normal distribu- 
tion. They have pointed out that although the frequency distributions of sur- 
vival times in these diseases are markedly skewed to the right, the logarithms 
of the survival times are approximately normally distributed. Thus, if one were 
to plot the cumulative distribution of the survival times of patients with these 
diseases on log-normal graph paper (Codex Book Co., Inc., No. 31376) a 
straight line should be obtained. An inspection of the graphs presented by 
these authors often showed, however, that the observed distribution was 
markedly convex with respect to the predicted straight line. A similar disparity 
was observed in a recent study [4] of survival of patients suffering from chronic 
leukemia (Fig. 1). 
Gaddum [5] has shown that such deviations can be corrected by subtracting 
an appropriate constant from the survival times. This transformation results 
in a three-parameter or translated log-normal distribution. That is, the variable 


t defined by 
1 
t=—1 1 
: og ( ; ) (1) 


is a standard normal variate with mean zero and variance one, where z is the 





* This study was aided in part by a field investigation grant from the National Cancer Institute, National 
Institutes of Health, United States Public Health Service. 
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LOG-NORMAL SURVIVAL DATA 





CLL M 
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Fie. 1. Duration of survival from date of diagnosis of 234 patients with chronic lympho- 
cytic leukemia. Duration of survival in months is plotted along the logarithmic scale. 
The straight line was calculated by Lea’s method [7]. 


duration of survival measured from date of diagnosis, onset of symptoms, or 
some other point in the course of the disease, a is the appropriate constant, and 
log b and ¢ are the mean and standard deviation of the variable log (x—a), 
respectively. 

To date no method has been proposed for fitting a translated log-normal 
distribution to incomplete data, although maximum likelihood solutions for 
complete data have been provided by Cohen [2]. Modifications of Cohen’s 
equations applicable to life data, as employed in a recent study of survival in 
chronic leukemia [4], are presented below. It has not yet been ascertained 
whether the values obtained by this modified maximum likelihood procedure 
are in complete agreement with the exact maximum likelihood solution for the 
case of incomplete data. Since the latter procedure is not yet available, the 
method proposed below must rest on heuristic grounds. It may be noted, how- 
ever, that if the survival data are complete, the procedure proposed below is 
strictly equivalent to the maximum likelihood solution for grouped data. 


2. VALIDITY AND INTERPRETATION OF THE LOG-NORMAL DISTRIBUTION 


Before proceeding to the method of estimating the parameters of the trans- 
lated log-normal distribution some non-mathematical problems of interpreta- 
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Fic. 2. Duration of survival from date of diagnosis of 234 male patients with chronic 
lymphocytic leukemia. Duration of survival in months plus 4.0 is plotted along the 
logarithmic scale. The straight line was calculated by the method described in the text. 


tion and validity should first be considered. The first problem is the validity 
of assuming that survival data follow a translated log-normal distribution. 
This problem can be resolved only in terms of empirical observations. The 
authors noted above have shown that the logarithms of survival times are ap- 
proximately normally distributed. Fig. 2 shows that the subtraction of an ap- 
propriate constant may improve the approximation. Hence, empirically it 
seems valid to assume that the translated log-normal distribution is appropriate. 

The second consideration is whether the use of a third parameter, the con- 
stant to be subtracted, appreciably improves the goodness-of-fit. A comparison 
of Fig. 1 with Fig. 2 shows that the improvement is appreciable. In fact, an ex- 
ceedingly poor fit (P is approximately .001) is changed to a good fit (P is ap- 
proximately .66). Thus, it seems justifiable, in certain cases, to use the more 
general three-parameter distribution in preference to the two-parameter dis- 
tribution (i.e. where a is assumed a priori to be zero). 

The third problem is the interpretation to be given to the three parameters 
and the logarithmic transformation. Any venture at a biological interpretation - 
of a mathematical phenomenon is hazardous. In general, however, a translated 
log-normal distribution might be interpreted as meaning that a multitude of 
factors affect the variable under consideration, that the effect of each factor de- 
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pends upon the accumulative effect of the factors which have preceded it, and 
that the factors apparently began to act at the point “a” in the original time 
scale. The point “a”, however, is not necessarily the time of onset of the 
disease, nor need it be any other clinically recognizable point in the natural 
history of the disease. It is strictly speaking nothing more than a mathematical 
estimation of a parameter which enables us to fit a theoretical distribution to 
an empirical one. If one desires to interpret it biologically, it might be described 
as the apparent origin in time at which the cumulative factors which lead to 
death began to act. It should be emphasized again, that these interpretations 
may have no real basis in observable facts. They are, however, in agreement 
with some theoretical models that have been suggested for the etiology of neo- 
plastic disorders [10]. 


3. MAXIMUM LIKELIHOOD EQUATIONS 


Under the condition that log (x—a) is normally distributed, the density 
function of x would be given by 


Br. exp| tos? (“—*) | if 2>e 
f(z) = je6 a) ie 2c? b (2) 
0 if x<a 





If the duration of survival of each patient were known, the parameters of 
this distribution could be obtained from the maximum likelihood estimators 
described by Cohen [2]. He has found that the maximum likelihood estimates 
of log b and c? are given by 


1 n 
log 6 = — > log (a, — 4) (3) 
nN int 
and 
1 n 1 n 2 
— } log? (2; — a) — (— > log (x; — ‘)) (4) 
nN imi i=1 
while the maximum likelihood estimate of a is the solution, 4, of the equation 


(a) = | > — lle 2 log (x; — a) —n > log? (x; — a) 


i=) Li — i=1 
+ (Live ce: - o) Jw were © a (5) 


where n is the number of patients observed and 2; is the duration of survival of 
the ith patient. \(a)=0 may be solved for @ by an iterative procedure and 
the solution subsequently substituted in the preceding equations to obtain the 
estimates log 6 and @. 


4. ADJUSTMENT FOR INCOMPLETE FOLLOW-UP 


In human survival studies, however, the duration of survival is generally not 
known for each individual. Some patients may still be alive at the termination 
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of the study, while others may withdraw from the study prior to their deaths 
and be lost to follow-up. For such patients the value of 2; would be unknown. 
One method of making allowances for these patients is the actuarial or life 
table method. The life table method is based on the calculation of conditional 
probabilities of dying during short intervals of the period under consideration. 
A commonly used expression for calculating the probabilities of dying during 
any interval, is known as the “}-rule” and is given by the following equation :* 
D; 6) 
y dete 
Harris, Meier, and Tukey [6] have pointed out “that this ‘}-rule’ is not an 
assumption, but the natural, mathematical, first approximation to maximum 
likelihood in a situation where even a second approximation is rarely needed.” 
Having calculated the values of g; for each interval, these are then applied 
to a hypothetical number of individuals, 1,, alive at the beginning of the first 
interval, e.g. 1,000, to yield 1;, the number of individuals of the original 1,000 
who are alive at the beginning of the 7th interval, and d,, the number of deaths 
that occur during the 7th interval, for 7 ranging from 1 to p. The values of the 
l; and the d; may be obtained from the following relations: 


d; = qi li (7) 
Liga - l; *~ d; (8) 


The adjusted survival curve of a group of patients may next be obtained by 


plotting the proportion who have died as a function of time. This is done by 
plotting the cumulative sum of d;/l,; in the life table against the endpoint of 
the last interval in each sum. 


5. ESTIMATION OF THE PARAMETERS 


An intuitive approach toward estimating the parameters of the log-normal 
distribution would be to treat the numbers dying in each interval as calculated 
in the life table, the dj, as if they had arisen from an ordinary random sample. 
This was the method adopted in the study on chronic leukemia mentioned pre- 
viously. Before substituting the values obtained from the life table into the 
usual maximum likelihood equations, however, certain adjustments must be 
made for the grouping. 

When using grouped data, the assumption is usually made that all the values 
within any small interval can be considered to be concentrated at the mean 





* The usual life table notation has been slightly modified to allow for summation over the index for intervals 
rather than directly over time. The following notation is employed: 
ye—the beginning of the ith interval. 
L;—the number of individuals alive and under observation at the beginning of the ith interval, i.e. at time y;. 
D,;—the number of individuals known to have died during the ith interval. 
W,—the number of individuals withdrawing during the ith interval. 
qi—the conditional probability of dying during the ith interval, given survival to y;. 

If the number of intervals in the life table is p, the initial point of the time scale, y:, is set equal to the (unknown) 
parameter a, yz is chosen to be positive and larger than a, and the upper endpoint of the last interval, Up+4, is set 
equal to plus infinitity, while the remaining p —2 points are placed at any convenient distances between y: and Up 
Since the value of a is not known at the outset of the calculations, 7: may be taken to be the origin from which 
survival is measured and set equal to zero, 
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of the portion of the distribution over that interval. That is, in terms of the 
transformed variable, t, it is desired to find the point t,* for each interval such 
that 





tin | tin =] 
t,* — Ss —H/2 dt = f a te"! dt (9) 
t; V2 ty V2r 
where 
1 r= Ss 
t; = — log Y (10) 
c b 


For all intervals but the first and the last, the usual assumption may be made 
that the desired point is the midpoint of the transformed interval, i.e. 


t* = $(ti + bigs) t= 2,3,---,p—1 (11) 


The point in each interval on the original time scale which is mapped into the 
t;* is given by the inverse relationship 





1 yi* —a 
t? = log ( (12) 
c b 
Substituting (12) in (11), the following simplification is obtained 
log (yi* — a) = p(log (yi — a) + log(yi41-—@)) t= 2,3,---,p—1 (13) 
For the first and last intervals Eqns. (11) and (13) do not hold since y,=a, 
Yoti= ©, i= — ~, and t,,1= ©. Estimates of the means of these intervals can 
be made from the data in the life table by making the assumptions that 
dy * 3 , 
a f edt (14) 
Ly; —0 Vv T 
and 
d one | 
=f —ae "dt (15) 
h tp V2r 
Equation (9) can then be solved explicitly to yield 
Lp ew 
dy, / 294 
and 
L fe ty?/2 
t += — | 17 
Yel oe (17) 


where f, and ¢, are determined from (14) and (15), respectively. 

It is to be observed that the values of y,;* ({=2, 3, - - - , p—1) as determined 
by Eqn. (13) are independent of b and c, and depend only on a. The values of 
y:* and y,*, however, obtained by substituting the values of 4:* and ¢,* found 
from Eqns. (16) and (17) into Eqn. (12) depend on all three parameters. 

A method of determining y,* and y,* which is independent of the parameter 
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b is provided by the relations 

log (yi* — a) = log (yz — a) — c(t, — t:*) (18) 
and 

log (yp* — a) = log (yp — a) + c(t,* — ty) (19) 


where fs, t,, f:*, and t,* are determined by Equations (14) through (17), re- 
spectively. Initial estimates of the parameters a, b, and c may be made by short- 
cut methods to be discussed below. 

The above relations provide convenient estimates for the y,* to be used in 
the following modifications of Cohen’s maximum likelihood equations which 
are to be used in the estimation of the parameters of the log-normal distribution 
from the grouped data of the life table: 


1 2 
log 6 = — }> d, log (y; — 4) (20) 


1 t=l 

le 12 : 

= > d; log? (y* — 4) — l > d; log (y* — a)| (21) 
1 i=l 1 i=l 


Pp 


d; 4 
| bie ams | E > dj log (y* — a) — Ly DY dj log? (y.* — a) 


t=1 t=1 


P d; 1 ( * = ¢ 
2g dilogy*—@) | 


Pp 2 
> d; log (y.* — .)) | — 1; 


i=1 yi* —a 


t=1 
6. SHORT-CUT PROCEDURES 


In order to find a first approximation for a to be used in Eqn. (22), there is 
available a short-cut procedure which was first described by Gaddum [5]. 
Gaddum noted that if z:, z2, 23 are three values from a log-normal distribution 
which are equidistant on the normal deviate scale then z2—a is the geometric 
mean of z;—a and z3;—a. He has, therefore, suggested the following simple rela- 
tionship for estimating a 


95 
21 + 23 — 222 _ 
where a* denotes the estimate of a. 

The values of 21, z2, and z; may easily be obtained by plotting the cumulative 
distribution of the survival times, }°d;/l:, on the log-normal graph paper dis- 
cussed in the introduction. Then, taking zz to be the median of the sample dis- 
tribution, the values on the time scale corresponding to any two noints on the 
sample distribution which are a given distance, h, above and below the median 
on the normal deviate scale will be z; and 21, respectively. 

After a* has been determined, c*, the initial estimate of c to be used in 
Eqns. (18) and (19), may be found from the following relation: 

‘ _ log (zs — a*) — log (2; — a*) 


asieneatiainagiedipeagaiedt — 24 
2h ) 
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where 2h is the distance between the points corresponding to z; and z3 on the 
normal deviate scale. 
If desired, an initial estimate of b, b*, may be easily obtained from the relation 


b* = 22> a* (25) 


For known a, and for complete data the estimate c* is most efficient when h 
is equal to 1.476, so that z; is equal to the 7th percentile, and z; is equal to the 
93rd percentile of the sample distribution. The efficiency [3, pp. 77, 265] of this 
estimate is .65. The efficiency of the estimate b* for known a is .637. The 
efficiency of the estimator a* is not known. 


7. RECAPITULATION 


To recapitulate, the steps in the procedure discussed above for estimating 
the parameters a, b, and c, from the life table data are as follows: 


. Calculate the d; using Eqns. (6), (7), and (8). 

. Plot }¢d,/l; on log-normal] graph paper. 

3. Determine the median ze, and, choosing h as close to 1.476 as feasible, 
the values z; and zs. 

. Calculate a* and c* by Eqns. (23) and (24), respectively. 

. Determine f. and t, by means of Eqns. (14) and (15), respectively, and 
use these values to calculate ¢;* and ¢,* from Eqns. (16) and (17), re- 
spectively. 

. Using the values obtained in Step 5, calculate log (y:* —a*), log (y,*—a*), 
yi* and y,* from Eqns. (18) and (19), substituting for a and c in these 
equations the values a* and c* obtained in Step 4. 

. For i=2, 3, ---, p—1 calculate log (y,*—a) and y;* by means of Eqn. 
(13). 

. Solve Eqn. (22) for 4, using a* and the values of y;* and log (y;*—a*) 
obtained in Steps 6 and 7 as initial estimates. This is the most tedious 
step in the procedure, because in addition to the calculations prescribed 
by the equation, it will be necessary, for each successive estimate a,* of 
a, to make a new estimate c;* of c, and to calculate new y;*. Inverse inter- 
polation can be used to determine 4 once a sufficiently small interval 
(a,*, a,*) has been found such that 


d(a,*) > 0 and X(a,*) < 0. 


. The estimates log 6 and é can be found from Eqns. (20) and (21) using 
the value of 4 calculated in Step 8 and the values of log (y,*—4) based 
on it. 


8. GOODNESS-OF-FIT TESTS 


The Chi-square test is commonly used to test for goodness-of-fit of an ob- 
served distribution function to a theoretical or calculated distribution function. 
The Chi-square test may also be applied on heuristic grounds to the present 
situation. A convenient expression, in terms of the life table data, is: 

x = N > (d; — l,-fi)? 


itm = 
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where f; is the expected relative frequency for the ith interval obtained by 
transforming the endpoints of the life table intervals into standard normal 
variates by means of Eqn. (10) and using a cumulative normal distribution 
table to obtain 


fi = Prob (t; < t < tis) (27) 


The number of degrees of freedom, n, is equal to p—4. 

The value of N, what is commonly called the number of observations, is not 
uniquely determined in the present situation. Limits may be set on the value 
of X, by assigning an upper and lower bound to N, viz. 


N’=T1, (28) 


and 


Pp 
N" =L,— > Wi, (29) 


respectively. The two values of X,? thus obtained will yield lower and upper 
bounds on the value of P, the probability of obtaining a fit as bad or worse 
than that observed. 

In the case of the study on chronic leukemia, it was decided to use the lower 
probability in deciding if a fit was too poor to be acceptable, while an inter- 
mediate value obtained by assigning to N the value 


oe . 
N” = Ly _-_ > W; (30) 


t=1 
was reported if the fit was clearly acceptable. 


AN EXAMPLE 


The following example was drawn from a recent investigation on the survival 
of patients with chronic leukemia [4]. The largest group, males with chronic 
lymphocytic leukemia, consisted of 234 individuals of whom 26 (11.1%) were 
lost to observation. The duration of survival of each patient was measured in 
months from the date of first diagnosis. The reasons for this choice have been 
adequately discussed elsewhere [8, 9]. The life table was calculated and the 
adjusted survival curve was plotted as shown in Table 1 and in Fig. 1. 

Two estimates of “a” were made by the short-cut procedure. In the first, the 
twentieth, fiftieth, and eightieth percentiles were estimated from the graph. 
These yielded values of 2.2, 11.2, and 34.5 months, respectively, and the resulting 
value of a;* using Eqn. (23), was —3.5. In the second estimate, the fourteenth, 
fiftieth, and eighty-sixth percentiles were estimated from the graph. The cor- 
responding values were 1.0, 11.2, and 41.8 months, respectively, giving a value 
of a,* of —4.1. This discrepancy is due to slight variations in the curvature of 
the observed distribution function. The values of b* and c* corresponding to 
the above estimates of a* are shown in the accompanying table. 

By means of Eqns. (14) and (15), and the values /;,=1,000, d;= 142, and 








LOG-NORMAL SURVIVAL DATA 





543 


TABLE 1. COMPARISON OF SEVERAL METHODS OF FITTING A TRANSLATED 
LOG-NORMAL DISTRIBUTION TO THE OBSERVED SURVIVAL OF 
PATIENTS WITH CHRONIC LYMPHOCYTIC LEUKEMIA 








Months 
after 
diagnosis 


Num- 

ber of 

deaths 
observed 


Num- 

ber of 

with- 
drawals 


Expected number of deaths of an initial 1000 patients 
as calculated by 





Lea’s 
Method 


Short-cut Method 





20, 50, 80 
percentiles 


14, 50, 86 
percentiles 


Modified 
maximum 
likelihood 

method 





-~ 
~— 


on rk WN = 


~I 


15 
18 
21 
24 
27 
30 
33 
36 
42 
48 
54 
60 
72 


30- 
33- 
36- 
42- 
48-— 
54- 
60- 
72-— 84 
84- 96 
96-120 
120-132 


mocooocor FP OCCCrR OFM OCOCWOrF OC OF KE KF Or KF OOF Nw 


89 
92 
74 
65 
55 
42 
43 
28 


138 
43 














26 








1000 





—4.0 
15.2 
0.4433 





df. 


X*min(N = 208) = 
Pp’ = 
X*max(N = 234) = 
Pp’ = 














22 

16.68 
.78 

18.77 
-66 
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d,=10, obtained from the life table, the values of t, and t, were calculated. 
These were found to be —1.07 and 2.33, respectively. Solving Eqns. (16) and 
(17) gave values of t;* and t,* of —1.58 and 2.64, respectively. 

Steps 6, 7, and 8 of the Recapitulation were then carried out using the values 
of t,*, t,*, a,*, and a,* calculated above. After just one iteration the interval 
(a,*, a,*) was sufficiently narrow to allow for inverse interpolation, and the 
value 4 equal to —4.0 was obtained. The estimates 6 and é were then calculated 
and are shown in the Table. The resulting distribution function is shown in 
Fig. 2. 

For comparative purposes, the values obtained by Lea’s method [7], where 
ais assumed a priori to be zero, are also shown. Goodness-of-fit tests were done 
for each set of parameters. The range on the value of x? and P is shown for each 
test in the last four rows of Table 1. Since there were 26 intervals after pooling, 
the degrees of freedom using Lea’s method are 23, while for the other methods 
they are 22. 

It is seen that the best fit is obtained by the modified maximum likelihood 
method. It should be observed that the second set of estimates by the short-cut 
procedure yielded solutions that differed but slightly from the modified maxi- 
mum likelihood solutions. This suggests that if the adjusted distribution func- 
tion is sufficiently smooth and if the points 21, z2, and z; are properly chosen, 
then a great deal of computational labor may be avoided with only a slight loss 
in accuracy by using the short-cut procedure alone. 
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ON THE CHOICE OF PLOTTING POSITIONS 
ON PROBABILITY PAPER 


Braprorp F. KimBaLu 
Slingerlands, New York 


This article is an attempt to throw more light on the problem of 
choosing plotting positions for the frequencies related to ordered 
sample values. The problem is particularly pertinent where special- 
scale graph paper is used such that a straight line indicates conformance 
of data to type of distribution assumed to apply. Such graph paper is 
often briefly referred to as “probability paper.” The types of distribu- 
tions here considered are the normal and the extreme-value distribution 
of Type I, with the accent on the extreme-value case. 

Various mevhods of determining plotting positions have been put 
forward and there is confusion of thought in judging what plotting 
convention is optimum. An optimum criterion proposed by Chernoff 
and Lieberman has received wide attention. Blom in a recent book 
develops further criteria [2]. 

To clarify the problem the author has analyzed the purposes to be 
served in plotting data on probability paper. With a specific concept 
of the purpose to be served in mind, it becomes clearer to the investi- 
gator which plotting convention would be the best one to use. 


1. GENERAL STATEMENT OF THE PROBLEM 


ET 21, %2, * * * , {, denote the sample values of a variate x which has location 
parameter yu and scale parameter 8. Define the “reduced variable” y by 


y = (x — u)/B. (1) 


The distribution functions of the reduced variable often do not involve unknown 
parameters. Two outstanding examples are the normal distribution 


F(y) = (1/V2=) f “e# itt (2) 


and the extreme-value distribution of Type I 
F(y) = exp [—e~]. (3) 


In the case of the normal distribution yu is the mean of z and £ is its standard 
deviation, and in the case of the extreme-value distribution uv is the mode of x 
and @ is J6/x times its standard deviation. 

Special coordinate plotting papers have been devised such that the x values 
are spaced arithmetically on the vertical scale and the y values are spaced 
similarly on the horizontal scale. There is further superimposed on the hori- 
zontal scale, a scale of cumulative frequency values obtained from relationships 
like (2) or (3) above, with values of F(y) directly above or coincident with the 
corresponding y values (Chart 1).* 

A property of such plotting paper is that if observed values of x conform to 





* Admitted that the variable z in ordinary graphs is taken as the horizontal coordinate, most probability paper 
is designed to show the functional scale f(y) measured horizontally. Thus the accompanying charts are so depicted. 
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the assumed distribution function F(y) the relation between z and F(y) will be 
linear. The following discussion will be concerned with the x values arranged in 
order of magnitude. The symbolism 


USS °-' SU (4) 


will be adopted for the z values arranged in ascending order. The corresponding 
values of the reduced variate y are denoted by 


w; = (ui — »)/B, Wi SWv2S--++ S wp. (5) 


Thus to a given value u; there corresponds a value of w; and if the parameters uv 
and 8 are known, from tables of the probability distribution ((2) or (3) above) 
the scale readings F(w,) can be precisely found. However, since the parameters 
u and 6 which relate w,; to u; are usually not known, there arises the question of 
how to choose F'(w,) in plotting a point to represent a u; value. 

The various plotting conventions described in the following paragraphs are 
based wholly on the size of the sample and the nature of F(y). The numerical 
values of the observations will not play a part in these descriptions. The con- 
ventions will indicate appropriate values p; on the scale of the cumulative dis- 
tribution or values v; on the scale of the reduced variate corresponding to the 
p;. As in formulas (2) and (3) the p; may then be expressed in terms of the »; by 
the relationship 


pi = FQ) (6) 
or conversely 
y= F-‘(p)). 


Thus if p; happens to be equal to F(w,), v; will equal the reduced variate w; 
(see (5) above). In general this will not be so since » and 8 are unknown. 

Most of the methods which have been in common use for selecting an ap- 
propriate scale reading p; are dependent upon the probability distribution of 
the ith ordered statistic. If f(y) denotes the probability density function asso- 
ciated with F‘y), then for a given order number i this probability distribution 
can be written as [2, p. 40 or 6]. 


n 

pap of we =(")ilPewa]ft — Fed] Fen), with ws = (ws — 19/8. 
1 

From this relation it can be shown that 


E[F(w,)] = i/(n + 1). 


For the normal and extreme-value distributions values of E(w,) = w; have been 
tabulated over a limited range of n; for the normal distribution [13] for 
n=1(1)20, and for the extreme-value distribution [11] for the 25 largest values 
of i, with n=1(1)10(5)60(10)100. Values of p; corresponding to the median 
and mode of w; have been recommended by some [1, 5, 6]. Median values have 
been tabulated for n=1(1)10 [15]. Modal values of w; have not been tabulated. 
One early method [7] recommended is to take p;=(i—})/n. This amounts to 
choosing the middle value of the interval from (i—1)/n to i/n. Thus at the 
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extremes pi=}3/n and p,=(n—})/n designate the scale readings to be asso- 
ciated with u and u,. Recently Blom [2] has recommended for the normal dis- 
tribution, p;= (i— })/(n+}). We shall refer to the above methods for selecting 
p; as “direct” methods since the rationale involved is based directly upon the 
order number 7.* They are summarized in the tabulation which follows. 


DIRECT METHODS OF SELECTION OF SCALE READINGS p; 








Method Number Reduced Variate v; Scale Reading p; 





I v; = F-[i/(n+1)] p, =i/(n+1) =E[F(w,) } 
IIA v, = F[(§—9)/(n+4)] pi = (i —9)/(n +4) 
II vn, =F[(¢-4)/n] pi=(t—4)/n 
III v;, = E(w) =; pi = F (wi) 
IV v; = median of w; =f; pi =F (Si) 
Vv v; = mode of w; =; pi = F(e) 











2. THE PURPOSE SERVED IN USING PROBABILITY PAPER 


Before proceeding further it is to be noted that the simplification afforded 
by the use of probability-scale graph paper is a visual simplification. The prob- 
ability paper transforms a curvilinear distribution into a straight line. If the 
approach is to be purely analytical there is no point in using the special scale 


paper. 
It then becomes important to have in mind the purpose served by plotting 
the observed data on the special scale graph paper. In general there are three 


rather different kinds of purposes that might be served. These are: 

(1) A test as to whether or not the sample data indicate that the universe is 
of the prescribed type. One argues that the universe is of the prescribed type 
only if the plotted points tend to lie along a straight line. 

(2) The graphical method may be used as a short-cut in estimating the stand- 
ard deviation of the distribution, which in turn is directly determined from the 
slope of the fitted straight line. 

(3) Graphical extrapolation at one of the extremes. This is the purpose most 
commonly served in plotting data :».m an extreme-value universe. Data are 
often plotted on extreme-value graph paper when it is known that on the lower 
range of maximum values the extreme-value distribution of Type I does not 
apply [9, p. 767]. If in the upper range there is likely to be good conformity 
with the Type I distribution a straight line fitted on the upper range is used as 
a basis for extrapolation of large extremes beyond the range of plotted points. 

These objectives can overlap. When accenting (2) one may well have a 
weather eye on Objective (1). For example, in examining a batch of samples 
taken from different populations one may have inferred that the universes are 
normal and so accent Objective (2). However, some errant populations may 
deviate considerably from the normal and so one may also give some weight to 
the graphical test of normality. Similarly in accenting Objective (3), Objective 
(2) is important. Furthermore, as noted above, there are situations where the 

* In Method IIA recommended by Blom for the normal distribution a criterion not directly related to the 


order number is included [2, p. 144 (c)]. Since the convention closely follows Method II we have included it under 
the category of “direct” methods. 
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data over the lower range are known not to follow the prescribed distribution; 
in which case data on the upper range are given greater weight in fitting a 
straight line. Thus Objective (1) is involved in indicating what part of the 
range conforms to the prescribed universe. 


3. CRITERION SUGGESTED BY CHERNOFF AND LIEBERMAN 


Chernoff and Lieberman [3, 4] have suggested a criterion for selecting in 
the normal case an “optimum” method of plotting.’ In the earlier paper the 
authors state: 

“The purpose of this paper is to indicate, with a special example, that any graphical 
technique should depend to a large extent on the purpose for which the graph is 
drawn. In particular, it presents tables covering sample sizes up to 10, for selecting 
the (scale readings p;) on normal probability paper so as to obtain ‘optimum’ graphi- 
cal estimates of the mean and the standard deviation o of a normal population. The 
somewhat more complicated problem of selecting the (scale readings p;) to obtain 
an ‘optimum’ test for normality is not discussed.”* 


Clearly CL do not pretend to offer a criterion for Objective (1), and further 
warn that the purpose served by the graph should be a factor in making a se- 
lection of methods. 

In order to translate the process of visually fitting a straight line to a set of 
plotted points into an analytical process CL propose the assumption (herein- 
after referred to as Assumption A): Assumption A “Let us assume that the 
visually fitted line is a very good approximation to the line that would be ob- 
tained by minimizing the sum of squares of the deviations (in the zx direction) 
from the line.” 

This assumption quoted from the 1954 paper is also the basis of the analysis 
further pursued in the 1956 paper. The analytical process of fitting a line to a 
prescribed set of plotting positions which minimizes the sum of squares of the 
deviations in the x direction yields sampling variances of the resulting analyt- 
ical estimates of the parameters » and 8. Thus CL argue that the optimal char- 
acter of a set of plotting positions might be judged in terms of the resulting 
magnitudes of the sampling variances of the parameters. This is taken as their 
criterion for judging the optimal character of a plotting method. 

Since the parameter 8, determined from the slope of the line, is the more 
difficult one to estimate, the accent is placed on the variance of this estimate. 
Thus CL in the 1954 paper determine values of p; for sample sizes 2 to 10 
which minimize the variance of an unbiased estimate of 8=c. Also it is shown 
that a biased estimate of 8 with minimum mean square deviation can be found, 
and the plotting positions for this biased estimate of 8 are given. 

In the 1956 paper CL also prove that for the normal distribution selection of 
p; based on minimizing the variance of 8 also furnishes the optimum plotting 
positions for purposes of extrapolation [4, Theorems 1 and 2, and Example 2]. 

The validity of the CL criterion and of the corollary just stated depends upon 
the validity of Assumption A. 

The CL criterion has received considerable attention because it provides 
analytical tests for judging the optimal character of a plotting method. 





1 Future reference to these writers will be by the letters CL. 
2 Parenthesis supplied by present writer. 
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Blom in his 1958 book [2, p. 144, conditions (a) to (c)] includes the CL cri- 
terion, but also adds another condition; namely, that 


Condition B “The points (u;p;) lie on the average on a line which deviates only 
very little from a straight line.” Analytically, Blom points out, this is to say 
that 


E(u;) = approx. uw + BF-'(p,;) = uw + Br. (8) 


Note that this relation becomes an equality under Method III since in this case 
v's taken as E(w,) =a. 

We shall show that Assumption A upon which the CL criterion is based is 
much more plausible if the distribution of the universe is symmetrical than if 
otherwise. Since the extreme-value distribution is finding a wide application 
in many scientific fields, we shall examine the application of the CL criterion to 
this type of distribution both from the standpoint of the numerical results 
which flow from accepting the CL assumption, and from the point of view of 
their relation to the graphical purpose which is to be served. 


4. EXTENSION OF THE CL ANALYSIS TO THE EXTREME-VALUE DISTRIBUTION 


As noted under (3) above, for the extreme-value distribution » is the mode 
of the parent distribution. Since the standard deviation of the reduced variate y 
is known to be #/+/6 it follows that the scale parameter 8 is related to the 
standard deviation of « by 


B = o2V/6/n. 


Thus the role of 8 in determining the best fitting line to a set of plotted points 
(v;, u;) on extreme-value paper is the same as that of o in the problem treated 
by CL in the 1954 article. Applying least squares’ theory (minimizing sum 
of squares of deviations in the z direction) the equation of the fitted straight 
line u=p+{v satisfies the normal equations 


fi+p=a 
v; — DO 


= 0; —- 5)? zc; = 0. (9) 


B = Leni, Ci 


We have here an estimate of 8 based on a linear function of the observed 
values «;. Optimum values of the coefficients c; have been determined by Lieb- 
lein to give an unbiased estimate of 8 with minimum variance for samples up to 
size six [12]. Another choice of these coefficients yielding an unbiased estimate 
of 8 for samples of sizes extending to n=112, based on a modification of the 
maximum likelihood equations has been suggested by the author in a previous 
paper [9]. 

It will be of interest to translate these methods by means of the equations 
(9) into conventions for choosing sets of plotting points (v;, u;) and note the 
results on extreme-value plotting paper. This we shall do for a sample of size 
six since this is the largest sample for which the optimum coefficients have been 
determined. The results will be compared with the direct conventions for choice 
of plotting positions v; indicated in (7) above. 
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Before discussing the determination of v; from c;, we present equations for 
evaluating the bias in the resulting estimates of » and 8 based on (9). Keeping 
in mind the fact that v;, # and c; are predetermined by the particular plotting 
convention used, we have 


E(p) = E(a) - (8) 
E(8) = 2eE(ui). 
With w; denoting the true value of the reduced variate associated with u; 
us = p+ Bu, 
ui =pt pw 
and 
E(u) = 4+ BE) = 4+ BE(y) = u + By, 
where 
y = Euler’s constant = 0.577216 - - - 
For a specific order index i= m, holding m constant, 
E(Um) = » + BE(Wm) = w+ Bom. 
Substituting in the first two equations, the biases in » and 6 are given by 


E(a) — » = Bly — (E(8)/8)s) 


E(8) — B= B[ Tew; = 1]. (10) 


These biases can be numerically computed for a limited range of n, except for 
the factor @ since 7 is known and a; has been tabulated [11]. 

The determination of the plotting positions v; from c;, which are consistent 
with (9) now proceeds as follows. From the second of the equations (9) one 
finds that 

L(v; —_ d)? => 1/Zc;?. 

4 

Hence (v;—#@) is determined by the second equation. Choice of é is independent 
of such determination. However, the bias of u as indicated by (10) depends 
upon the choice of 6. Each of the two sets of c; mentioned above give unbiased 
estimates of 8. Hence to give an unbiased estimate of u we take i= y. With 6 
thus chosen, the plotting positions v; are completely determined from the sec- 
ond equation of (9) to give unbiased estimates of 8 and uz. 

Conversely with values of v; taken from the direct conventions indicated by 
(7) values of c; are determined from (9) with # determined as the mean value 
of the v; selected. The resulting biases in @ and # are then found from (10). 

In order to compute the variance of an estimate of 8 note that since Zc;=0, 


B = 2e,(Bw; + w) = BIcw; (11) 


where w; denotes the true value of the reduced variate associated with 1. 
Since Lieblein has determined the variance-covariance matrix of w; for a sam- 
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ple of size six [12, Table B-1] the sampling variances of the various estimates 
of 8 by the linear relation (9) can be computed. 
In the tables included in Schedule A are shown the pertinent data associated 


SCHEDULE A. DATA FOR APPRAISAL OF OPTIMUM CHARACTER OF 
SEVEN PLOTTING CONVENTIONS BASED ON CRITERION PRO- 
POSED BY CHERNOFF AND LIEBERMAN APPLIED TO 
EXTREME-VALUE SCALE PLOTTING PAPER FOR 
SAMPLE OF SIZE SIX 


(Column headings indicate basis of plotting convention) 








ar we Ee vI VII 
¥. fi r= : paar ; Optimum | M.M.L. 
nty | | G-Dim if <, (Lieblein) | (Kimball) 





TABLE 1, PLOTTING VALUES OF VARIABLE 2; 





-6656 |— .8340 |— .9102 |— .7773 |— .7955 -8284 |—1.0793 | —1.1620 
-2254 |— .2979 |— .3266 |— .2545 |— .2850 -3440 0.4474 0.0373 
- 1658 -1421 - 1330 . 1884 . 1460 -0651 0.8412 0.6374 
- 5804 -6075 -6180 -6627 -6030 -4902 1.0342 1.03638 
-0893 1.2003 1.2459 1.2750 1.1805 -0059 1.1164 1.3366 
-8703 2.2504 2.4417 2.3690 2.1583 .7919 1.1033 1.5772 


Mean of »; -4691 .5114 - 5336 .5772 .5012 -3635 .5772 .5772 





TABLE 2. PLOTTING VALUES OF SCALE READINGS pj ON EXTREME-VALUE PAPER 
. 1429 - 1000 -0833 -1135 -1091 -1013 -0527 -0409 
- 2857 - 2600 - 2500 -2753 - 2644 .2440 .5277 
-4286 -4200 -4167 -4368 -4214 -3918 -6497 - 5894 
-5714 -5800 - 5833 .5972 - 5786 . 5420 -7008 
-7143 -7400 -7500 - 7562 -7356 -6937 -7208 
-8571 -9000 -9167 -9107 -8909 -8465 -7176 



































TABLE 3. COEFFICIENTS c¢; RELEVANT LEAST SQUARES’ FIT 





. 2688 -2202 - 2022 -2122 -2298 -2661 | — .4593 
-1645 1325 -1205 - 1303 -1393 -1579 |— .0360 
-0718 -0604 -0561 -0609 -0629 -0666 -0732 
-0264 -0157 -0118 -0134 -0180 -0283 - 1267 
- 1469 -1128 -0998 -1093 -1204 -1434 -1495 
.3319 - 2846 - 2672 - 2807 - 2936 -3189 - 1458 











TABLE 4. BIAS AND VARIANCE OF THE ESTIMATES WHICH RESULT 
FROM LEAST SQUARES’ FIT* 





Bias of 2 .0010 .0546 0728 | 0 .0433 .1441 0 
Bias of 8 .2284 .0220 |— .0547 0 .0632 .1916 0 
Variance of 8 .2489 .1754 .1513 . 1686 .1888 .2325 
(Bias)? .0522 .0004 .0030 | 0 .0040 .0367 0 
Mean Squ. Dev. .3011 .1758 . 1543 . 1686 . 1928 . 2692 . 1320 





TABLE 5. VARIANCE OF ESTIMATES CORRECTED FOR BIAS* 





Cor. Factor -8141 .9785 1.0579 — . 9406 . 8392 — 
Squ. of Cor. 

Factor .6628 .9574 1.1192 - 8846 . 7043 
Var. of Cor. Est. . 1650 .1679 ; - 1693 .1671 .1637 





























® Biases of P and B are to be multiplied by factor 8, not shown (see equation (10)). Variance and mean square 
deviation of 3 are to be multiplied by factor 6?, not shown (see equation (11)). 
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with eight plotting conventions. As indicated previously the order number 7 
proceeds from the smallest x; to the largest. The headings tell the basis of each 
plotting convention and are self-explanatory except that M.M.L. stands for 
the “modified maximum likelihood” method of determining the coefficients in 
Table 3. As mentioned in the footnote the figures for biases and variances in- 
volve the factors 8 and 8?, respectively. This follows from (10) and (11). 

Where the bias of 8 is not removed by & correction factor the mean square 
deviation from the true value is shown along with the variance (Table 4). In 
Table 5 are shown the variances of estimates corrected for bias. This table is 
important if the graphical method were to be supplemented by introducing a 
correction for bias. Note that the correction factor is obtained by dividing 8 
by E(8) and that this ratio is independent of the value of 8, and can be computed 
for any value of n for which w; has been tabulated [11, 13]. Furthermore it 
brings out the fact that the “direct” methods when corrected for bias, all have 
about the same variance. The same phenomenon applies in the normal case 
(Table 10 of Schedule B). Furthermore a positive bias produces a large mean 
square deviation, while a negative bias produces a small one (cf. Methods I 
and II). This brings up the question as to whether or not the “bias” should be 
equalized before measuring the optimal character of a plotting convention by 
its mean square deviation. 


Note. As a digression from the problem posed in this paper one may note in 
connection with Methods VI and VII—the optimum and M.M.L. choice of c; 
—that the variance of the estimate of 8 by Method VII is only slightly greater 
than that of Method VI. This may perhaps indicate that the same is true for 
larger samples. This is of particular interest in connection with the problem of 
obtaining the best estimate of 8 based on a linear function of the observed 
sample values u; because the M.M.L. method involves coefficients which are 
easily computable for sizes of samples up to n= 112, while the optimum choice 
of the c; is only available for sample sizes up to n=6 [9]. 

For purposes of comparison we present in Schedule B the data associated 
with the CL criterion as it is applied to a normal universe, for the same sized 
sample. Method VI refers to the linear unbiased estimate of minimum variance 
referred to as Method 2 in Table II of the CL 1954 paper, and Method VII 
refers to the linear biased estimate of minimum mean square deviation, re- 
ferred to as Method 4 in that table. The biases of 8 have been computed di- 
rectly from the formula (10) above. It will be noted that because of the sym- 
metry of the normal distribution there is no bias in the estimates of », and it is 
necessary to show only plotting positions on one side of the mean. The others 
are obtained from considerations of symmetry. In Tables 6 and 8 this means a 
reversal of sign, and in Table 7 the complimentary figure is required. 

As in Schedule A it is of interest to note from Table 10 that when estimates 
of 6 are corrected for bias there is little to choose between the direct methods, 
judging by the magnitudes of the variances of the estimates; and in the normal 
case the indirect methods also have very nearly the same variance. 


5. DISCUSSION OF AN EXAMPLE 


We now consider an example of what happens when the plotting positions 
p; indicated by Table 2 of Schedule A are used for plotting data on extreme- 
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SCHEDULE B. DATA ASSOCIATED WITH CL CRITERION APPLIED 
TO NORMAL PROBABILITY PAPER FOR SAMPLES OF SIZE SIX 


(Column headings indicate basis of plotting convention) 








IIA VI 
I i-} i IV Unbiased 
Order No. i p= os ri= “=f Linear 

F = res it =Se 

i/(n+1) n+k (i—)/n (CL) 





TABLE 6. PLOTTING VALUES OF VARIABLE »; 





- 1800 -2019 -2104 -2016 .1983 -1919 -1773 - 1960 
- 5660 -6433 -6745 -6418 -6297 -6059 - 5684 -6285 
1.0676 1.2816 1.3830 1.2672 1.2313 1.1601 1.3025 1.4402 




















TABLE 7. PLOTTING VALUES OF SCALE READINGS pj ON NORMAL PROBABILITY PAPER 





.5714 . 5800 .5833 .5799 - 5786 .5761 - 5704 -5777 
-7143 -7400 -7500 -7395 -7356 -7277 -7151 -7352 
-8571 -9000 | -9167 -8975 -8909 -8770 -9036 -9251 





TABLE 8. COEFFICIENTS ce; RELEVANT TO LEAST SQUARES’ FIT 





-0603 -0481 -0436 -0490 -0508 -0548 -0432 
- 1896 - 1534 -1398 - 1559 -1613 -1731 -1386 
3577 - 3056 - 2867 -3078 +3154 -3315 +3175 


























TABLE 9, BIAS AND VARIANCE OF THE ESTIMATES OF ¢ WHICH RESULT 
FROM LEAST SQUARES FIT* 





-1741 -0093 |— .0763 -0269 
- 14599 - 10379 -09024 ° -1116 
-03031 -00009 -00582 -0007 
- 17630 - 10388 -09605 -1124 





TABLE 10, VARIANCE OF ESTIMATES CORRECT: 





Cor. Factor -8517 1.0094 1.0826 -9738 é . 1057 
Squ. of Factor -7254 1.0188 1.1721 -9484 -8504 -2226 
Var. of Cor. Es- 

timate .1059 . 1058 -1058 —- .1059 -1058 — . 1057 





























® Bias of ¢ is to be multiplied by factor ¢, not shown (see equation (16)). Variance and mean square deviation 
of ¢ are to be multiplied by factor o*, not shown (see equation (11)). 

> Values of mean square deviation for Methods I, II, VI and VII have been taken from Table II of CL {3}. 
The values for Methods IIA, III, IV and V were computed by the author. 


value scale plotting paper. For this purpose we use the sample most likely to be 
drawn; namely, the set (a, a, - - - , &) where i;=y+e;.* The set of values 
UW = 1.000, w= 1.555, us = 2.023, ug = 2.510, Us =3.100, ug = 4.000 with p = 1.9484 
and 6= 1.1449 satisfies this conditiox. 

On Chart 1 we show the positions of four plotting conventions other than 
those of the modal w,; (which lie along a straight line). The plotted points 
marked by the small circles represent positions determined by Method I and 
those marked by squares represent Method IIA. It will be noted that the me- 





* Some will argue that several randomly chosen samples would be preferable. The writer has set up such sam- 
ples. It would be still better to obtain 100 such samples and summarize. Since the samples are from a known type of 
universe such a summary can best be made by using known properties of the universe which spawns them. This we 
have done by choosing the modal sample for examination. We hasten to point out that to be consistent, whatever 
conclusions are drawn must also be obtained from the known properties of the parent distribution rather than from 
the particular sample exhibited. 
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dian positions at the two extremes lie between the points plotted by Methods 
I and ITA. However, the positions of Method VI, determined indirectly from 
the optimum linear estimate of 8 (marked by large circles) and Method VII, 
determined from the modified maximum likelihood estimate of 8 (marked by 
X’s), show a trend of a decidedly different character. The characteristic of 
this trend is that at both extremes the optimum indirect conventions show 
plotting positions at values of p less than the values associated with the central 
tendency (mode, mean of p;, median, mean of w;, etc; see Tables 1 and 2 for 
conventions not shown on the chart), and at the center of the set of values of 
the sample, the values of p; are greater than values represented by the central 
tendency. Hence it may be inferred that in general for the extreme-value case 
samples plotted by Methods VI and VII will be represented by plotting po- 
sitions which tend to lie along the above type of curve, rather than along a 
straight line. 

Note that the curvilinear tendency just observed results from a systematic 
violation of Condition B introduced by Blom (see (8) above). From Table 1 of 
Schedule A we find that the v; of Methods VI and VII are considerably less than 
the v; of Method III at each extreme, and in the middle portion are greater 
than the v; of Method III. Indeed, in the case of Method VI it is impossible to 
find a sample such that points plotted by this convention would lie along a 
straight line, since by this convention p, is less than p,-1. 

Some of the reasons why the indirect conventions produce the distortion 
noted are quite apparent: (1) the over-all minimizing condition of least squares 
gives no recognition to the individual signs of the expected deviations other 
than that collectively the signs should be such as to produce an unbiased slope 
and mean (or modal) value; (2) since the variances of u; are progressively larger 
as one proceeds towards the largest extreme, the numerical values of the coeffi- 
cients c; which produce minimum variance (cf. equation (11)) tend to be smaller 
at the upper extreme and larger at the lower extreme, than in the case of the 
direct conventions. This in turn steepens the trace of plotted points at the 
upper extreme and flattens it at the lower extreme. 

On the other hand with a normal universe and points plotted on normal 
probability-scale plotting paper, the results are quite different. This is illus- 
trated on Chart 2. Here again we have chosen a modal sample, with 7;= 4+ «. 
The set of values u%=1.000, w=1.717, u3=2.252, usg=2.748, u3=3.283, 
ug = 4.000 with 8 = o=1.2930 and »=2.500 satisfies this condition. The corre- 
sponding readings on the probability scale are shown in Table 7 of Schedule B, 
under Method V. The small circles on the chart represent plotting positions 
selected by Method I and the crosses show plotting positions selected by 
Method II. The large circles indicate “optimum, unbiased” plotting positions 
recommended by CL in the 1954 paper,—Method VI of Schedule B. In order to 
more clearly indicate the central tendency at the extremes, we show the plotting 
positions of the median p;=F({;) by arrows at i=1 and i=6,—Method IV of 
Schedule B. 

It is notable that in the normal case the indirect method produces a trace of 
plotting positions quite in harmony with plotting positions following the central 
tendency. Of course, for large samples the spread at the extremes will be greater. 
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But for any size sample, because of the symmetry of the normal universe, we 
shall not have the bending of the trace of plotting positions of Method VI in 
one direction relative to the trace of plotting positions following the central 
tendency (Methods III to V). The reversal in curvature of this trace alleviates 
the principal undesirable feature which may result from applying the indirect 
methods. 


6. CONCLUSIONS 


In the light of the facts presented in the above schedules and giving due 
thought to the purpose to be served in plotting data on probability paper, the 
investigator can select the plotting convention he will use with some degree 
of rational assurance. 


The Extreme-Value Universe 


Because of the asymmetry of this distribution there is a serious question as 
to the validity of Assumption A as it applies to Methods VI and VII. It is 
questionable whether the fitting of a straight line by eye to a trace of plotted 
points, which has a tendency to have considerable curvature all in one direction, 
would result in the same line (or nearly the same line) that would be determined 
by the analytical process of fitting by least squares (deviations taken in the xz 
direction). If one were to accept the proposition that there would be some 
tendency for the lines to be nearly the same, the question arises as to whether 
the probable error due to uncertainty of the eye-fit agreeing with the least 
squares fit would not be much larger than if the plotting positions had a tend- 
ency to lie along a straight line. Thus the investigator might well conclude that 
the indirect methods, VI and VII, are not suitable for any of the graphical 
objectives. 

On the other hand, in the case of the direct methods (which more nearly 
meet Condition B) perhaps one can accept Assumption A and give some weight 
to the variance measures shown in Tables 4 and 5 of Schedule A. The investi- 
gator might be guided as follows: 

If Objective 1 (test of parent distribution) is the main one, perhaps Method 
III would be the best. This will satisfy Condition B exactly. If values of w; 
and F(w,;) are not available, Method IIA which varies slightly from Method 
III in the direction of the mode might be used.! 

If Objective 2 (estimate of scale parameter) is the main one, Method III has 
the advantage of producing no bias. Method IIA might be substituted for 
purposes of simplicity. If a correction factor is available, Method I is simpler 
and after correction the variance of the estimate of 8 is low. 

Correction Factor = 8/E(8) =1/Ze,0,. 

1 Using some of the arguments used by Blom for proposing Method ILA as moet suitable for the normal case, 
one might arrive at the formula pj =(i —})/(n +4) as a good method in the extreme-value case [2, p. 74]. The author 
experimented with thie and found that although it reduced the negative bias of 8 of Method II from —.0547 to 
—.0266, with MSD of .1630, that when corrected for bias the variance came to .1713. Method ITA incurs a slight 
positive bias of 8 of .0220, but variance of corrected estimate is at .1679. Furthermore the plotting positions are 
slightly to the right of w, while in Method IIA they are to the left of #;. Thus for IIA they are on the same side as 


the modal positions ¢; and hence on the side where probability density is greater. Accordingly the author hag pre- 
ferred to introduce Method IIA as the compromise between Methods I and II in the extreme-value case, 








PLOTTING POSITIONS ON PROBABILITY PAPER 559 


If Objective 3 (extrapolation to the right) is the main one, there will be room 
for considerable exercise of judgment depending upon the weight given to the 
extremes u,_; and u, in fitting the line. Although Method III, under the pre- 
sumption that Assumption A is valid, gives no bias some consideration might be 
given to the fact that the distribution of u, is highly skewed. It might be well to 
supplement Method III (or the simpler Method IIA) by plotting the points 
determined from Method I. These latter are slightly to the right of the modal 
points (see Table 1) and so when taken along with the positions determined 
from Methods III or ITA furnish some idea of the range of uncertainty in extra- 
polating from the graph.? 

In other words, in dealing with Objective 3, although measures found from 
the CL analysis, as shown in Schedule A, are helpful, judgment will have to 
be exercised that cannot be guided precisely by formula. This is apt to be par- 
ticularly true for the extreme-value distribution, because at the lower extreme 
the sampled universe in many cases is known not to be precisely that described 
by formula (3). 


The Normal Universe 


In this case we do not seriously question the validity of Assumption A. Our 
greatest concern is with the matter of the “bias” in the estimate of B=c. We 
are inclined to be suspicious of basing a criterion for optimal choice upon the 
mean square deviation, if a significant bias is present. Blom has introduced a 
formula, Method IIA, which pretty well meets such objection, and at the same 


time carries a low MSD of the estimate of 8. It also is believed to satisfy Con- 
dition B for large values of n as well as for the values up to n=15 which have 
been tested empirically [2, p. 71, Table 2]. To meet either Objectives 1 or 2 we 
recommend Method ITA. 

For Objective 3, if one accepts Assumption A as entirely valid, we note that 
CL have shown in their 1956 paper [4] that the optimum criterion of minimum 
sampling variance (as computed in Table 9 above) or mean square deviation, 
if there is a bias, points to optimum extrapolation as well, in the same sense.* 
Accordingly, if one accepts Assumption A one can agree that Method ITA 
would be satisfactory for extrapolation. 

If, however, the graphical fitting of the line is presumed to be more influenced 
by the plotting positions of the extremes (say at the right, with extrapolation 
to the right) than would be true under Assumption A, then again there is need 
for caution in relying upon any single formula. 
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2 An interpolation chart which shows the position of the interval of maximum probability density for proba- 
bility level .50 applicable to u, for various values of n will prove helpful to the investigator faced with graphical 
extrapolation where considerable weight is given to the plot of the largest sample value. The author has set up 
such a chart both for the normal and the extreme-value universe which may be made available through publication 
at some time in the future. 

* The author has proved, in an unpublished co ication to Chernoff, 9-7-57, that the condition which leads 
to this corollary is not satisfied in the case of the extreme-value distribution. 
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BIBLIOGRAPHY ON SEQUENTIAL ANALYSIS* 


J. EDWARD JACKSON 
Virginia Polytechnic Institutet 


This article contains 374 references dealing with the subject of se- 
quential analysis. These references are listed alphabetically by author 
and also broken down into fourteen subject categories. 


HE concepts of sequential analysis go back many years but, in the modern 
p go of Wald, it is a relatively new subject barely having reached its 
fifteenth birthday. Nevertheless, a considerable amount of material has been 
written on various aspects of the subject. A great many of these articles have 
appeared in journals not published in this country and it would seem that a 
bibliography on this subject would be quite useful to those now working in the 
field. The format of this bibliography will be quite similar to that compiled by 
I. R. Savage (Journal of the American Statistical Association, Vol. 48, 1953, 
pp. 844-906) in the field of non-parametric statistics and will include, in so far 
as possible, all such articles which have appeared through 1959. 

As in many fields of statistics, there is the problem of definition. Theoretically 
any procedure wherein the decision to gather more information depends on 
the information already gathered can be considered a sequential procedure. 
This bibliography, however, will be restricted primarily, though not exclusively, 
to the Wald concept of sequential analysis and in so doing will exclude such 
special cases as double sampling, multiple sampling and response surface tech- 
niques except for some of the basic pioneering works on these subjects. These 
are, admittedly, borderline cases. There are so many articles on these subjects, 
particularly in various industrial journals, that it would be impossible to cover 
them all; however, the basic references are given as well as any which are more 
directly tied into standard sequential techniques. A true sequential procedure 
can be considered as both a decision procedure and a stochastic process and as 
a result, there is considerable overlap in the content of the literature in these 
fields. Inasmuch as possible, this bibliography is restricted to those articles 
dealing with decision functions and stochastic processes which make direct 
reference to sequential analysis as such. 

As in the case of Savage’s article, entries in this bibliography are arranged 
alphabetically by author (first author only in the case of joint authorship) 
and chronologically within authors. The letters after each entry refer to the fol- 
lowing list of categories: 


A—General and expository articles. 
B—Theory of sequential analysis. 
C—Sequential procedures for the binomial distribution. 
D—Sequential procedures for other discrete distributions. 
* Sponsored by the Office of Naval Research, Department of the Navy: Contract Number; NONR-2352(01). 
Task Order 042-019 with the Virginia Polytechnic Institute. Reproduction in whole or in part is permitted for any 


purpose of the United States Government. 
+ Now with the Eastman Kodak Company, Rochester, New York. 
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E—Sequential procedures for measures of central tendency—normal dis- 
tribution. 

F—Sequential procedures for measures of dispersion—normal distribution. 

G—Sequential experimentation and sequential analysis of variance. 

H— Sequential multivariate techniques including correlation, regression, and 
stochastic approximation. 

I—Stochastic processes. This category includes sequential procedures for 
Poisson processes, queuing problems and other discrete and continuous 
stochastic processes per se. Papers in which stochastic process techniques 
are used to develop sequential analysis theory are included in category B. 

J—Sequential non-parametric techniques. 

K—Sequential procedures dealing with proportions when the underlying 
distribution is normal. This includes probit analysis and the use of vari- 
ables for acceptance sampling for proportion defectives. 

L—Sequential procedures dealing with non-normal continuous distributions. 

Y—Miscellaneous. This category includes other sequential procedures not 
covered by the preceding categories. 

Z—Unclassified. This category is made up of publications which have not 
been seen by the author and for which no abstract has been found which 
gives a clear indication of its content. 


The numbers in brackets which sometimes follow these letters refer to the year 
that this particular publication was abstracted in Mathematical Reviews and 
the page on which the abstract may be found. Articles are assumed to be in the 


same language as the title unless otherwise indicated. Papers marked with an 
(*) also appear in Selected Papers in Statistics and Probability by Abraham Wald, 
McGraw-Hill Book Company, New York 1955. 
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A Time Series Analysis of Interindustry Demands. Kenneth J. Arrow and Marvin Hof- 
fenberg with the assistance of Harry Markowitz and Ronald Shephard. Amsterdam, 
Holland: North-Holland Publishing Co., 1959. Pp. vii, 289. $7.25. 


L. R. Kuern, University of Pennsylvania 


ier ent analysis is an accepted branch of econometrics based on a mathe- 
matical model and using actual statistical data of economy. Yet it has developed, 
unlike other branches of econometrics, without stochastic foundations. In a sense, 
past applications of this technique have been based on no degrees of freedom making 
it impossible to assess the reliability of estimates or make other statistical inferences 
about the universe. A major contribution of the Arrow-Hoffenberg study is to provide 
a probability model for input-output analysis. They make actual applications of 
their model to a sample of industrial data. 

The need for a probability model underlying the method is emphasized by its 
lack of accuracy in prediction. Tests of accuracy in the prediction of industry outputs 
from a given bill of final demand or, what is the same thing, tests of constancy of 
the input-output coefficients determined in the usual way from a single period’s 
observations have been made by proponents and critics of input-output. Arrow and 
Hoffenberg give a useful summary of these tests and show clearly that the coeffi- 
cients have not remained constant. Regression of individual output on total output 
and final demand blowups tend to give superior projections. 

The facts that input-output projections are decidedly not accurate and that they 
do not appear to deviate randomly from the observed values lead the authors to 
propose some new structural relationships in which residuals (between projected 
and observed outputs) are related to a selected number of important economic 
variables. They choose real per capita disposable income, the ratio of defense expen- 
diture on goods to private GNP, a time trend, and the ratio of positive deviations 
of output from past peak levels to current output. 

Their model takes the form! 


4 
a(t) = by + z cijeye(t) 
kel 
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a;;(4) = variable input-output coefficient of period t 
y.(t) = variable explaining fluctuations in a,;(t) 
z(t) = output of sector 7 in period ¢ 

fi(t) = final demand for sector 7 in period ¢ 

u;(t) = random disturbance. 


The 6;; and c;;, are unknown structural parameters. Essentially, the Arrow-Hoffen- 
berg method uses time series observations on 2,(t), y(t), and f,(t) to estimate by; 
and c; from the second of the two equations or a transformation of it. For individual 
years, there are not enough data available to make direct estimates of the first 
equations. It would seem appropriate to allow for stochastic variation in the first 
as well as in the second of the two above equations. This would have made some 
difference in the statistical treatment.” 





1 A similar treatment is given by F. E. A. Briggs, “On problems of estimation in Leontief models,” Econometrica, 
Vol. 25, July, 1957, pp. 444-55. 
2 Ibid. 
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The four particular variables mentioned above, y(t) (k=1,2,3,4) were used as 
factors of major practical importance. Relative prices were not included because it 
was felt that production is inelastic in the short run, but, surely, the input-output 
scheme is a long run model used at its best for the analysis of long run problems. 

The book contains an excellent discussion of problems of statistical inference for 
the type of model considered. There is a good description of methods of estimation 
in systems of simultaneous linear equations. For computational ease, they decide 
upon the use of Theil’s k-class of estimators with a particular value assigned to k. 
In view of present computing technology, there is little reason to shun limited infor- 
mation maximum likelihood because of cost considerations. The work breaks new 
ground with development of linear programming estimates, which take account of 
the non-negativity and other limitations on the size of input-output coefficients. 
In the end, better results are obtained for the linear programming than for the 
ordinary equation systems method of estimation. 

The authors use, as a point of departure, the 1947 interindustry study. Their 
time series values cover the years 1929-50. They consolidate the 1947 study into 66 
sectors. It is a “truncated” model in that many industry sectors are included in 
final demand. They carefully consider the major and negligible flows in their layout. 
Ultimately they estimate equations for variable input-output coefficients a;;(t), in 
only four sectors. They are Lumber and Wood Products, Crude Petroleum, Petroleum 
Products, and Rubber Products. In each of the four cases, they consider the a;;(t) 
to be variable for only a limited number of consuming sectors, at most eight and do, 
not use all four explanatory variables simultaneously. They had to make use of a 
number of other predetermined variables that would be relevant to a more complete 
model but that did not appear in the structural equations being estimated. With 
these, they were plagued with problems of multicollinearity and shortage of degrees 
of freedom. 

In general they come up with useful relationships explaining fluctuations in those 
a;;(t) that they chose to investigate. Their approach is highly interesting, ingenious, 
and properly executed. They frankly admit, however, that their findings lack positive 
power. They have not found a set of numerical values that are going to give genuine 
improvements of large magnitude over conventional input-output results using the 
1947 tables. They are perhaps too sweeping in their pessimism concerning the general 
futility of time series analysis. They say, “The complexity of the model makes it 
difficult to see clearly where it is wrong.” This reviewer would say that their brilliant 
new tools should have been applied more cautiously in a smaller, more aggregative, 
self-contained model with a gradual build-up of success upon success to the finer 
classifications that they chose. 


Elementary Decision Theory. Herman Chernoff and Lincoln E. Moses. New York: John 
Wiley and Sons, Inc. London: Chapman and Hall, Ltd., 1959. Pp. xv, 364. $7.50. 


Lionet Weiss, Cornell University 


yan this text represents a departure from the topics usually covered at this 
mathematical level, it seems desirable to start this review with a list of the main 
topics covered, chapter by chapter. 

Chapter 1: Decision making when there is uncertainty about the state of nature; 
strategies; average losses; admissibility and inadmissibility of a strategy; minimax 
average loss strategies; minimax average regret strategies. 

Chapter 2: Frequency distributions; histograms; cumulative frequency polygons; 
summation signs; sample mean and variance and their computation. 

Chapter 3: Probability as a long-run frequency; probability distributions and cu- 
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mulative distribution functions; random samples; the normal distribution; sets and 
functions and their use in probability theory. 

Chapter 4: Utility as a measure of value when the probability distribution is 
known; probability and expectation; application of utility to fair bets; the St. 
Petersburg Paradox; parameters of probability distributions. 

Chapter 5: Uncertainty due to the ignorance of the state of nature; randomized 
strategies; convex sets; Bayes strategies and supporting planes; minimax strategies; 
regret. 

Chapter 6: A posteriori probability; conditional probability; computation of Bayes 
strategies; independence. 

Chapter 7: Introduction to classical statistics; examples of hypothesis testing and 
of point and interval estimation; power; principles for setting the level of significance. 

Chapter 8: Mathematical models. 

Chapter 9: A summary of terminology and methods of testing hypotheses. 

Chapter 10: A summary of terminology and methods for point and interval esti- 
mation. 

Tables, and appendixes on notation, shortcut computational formulas, axioms of 
probability, properties of expectation, likelihood ratio tests, game theory, utility 
functions. 

In their preface, the authors recommend the first seven chapters for a first course 
in the fundamental ideas of statistics, requiring only high school mathematics. They 
feel that some background in statistics or mathematics would be desirable before 
attempting to cover chapters 9 and 10. 

The text is very clearly written, and there are many illustrative examples and 
exercises. The reviewer feels that the arrangement of the material should have been 
made somewhat tighter: for example, the sections on probability theory are scattered 
in various parts of the text. Many of the appendixes are on a higher mathematical 
level than the main part of the text, and it is doubtful whether a student who would 
be challenged by the first seven chapters would be able to handle these appendixes. 

Although there is little doubt that as time goes on more and more first courses in 
statistics will be given from the decision theory point of view, at present this is 
rarely done. One reason for this inertia has been the lack of suitable textbooks. The 
text by Chernoff and Moses requires more mathematical maturity (which is not 
identical with formal course work) than most of the current pre-calculus texts for 
first courses in statistics, but it should be considered by those instructors who would 
like to introduce a more modern (and a more logical) approach in a basic course in 
statistics. 

Operations Research: An Annotated Bibliography. James H. Batchelor. Saint Louis: Saint 
Louis University Press, 1959. Pp. ix, 866. 
O. H. Browntez, University of Minnesota 


I THIS volume are listed and (in many cases) abstracted more than 4,000 books 
and papers having some relevance to that nebulously defined area “operations 
research.” The papers listed include all of those from some American, British, Italian, 
French, German, and Japanese journals devoted to operations research or one or 
more of its fields as well as reports and published materials from conferences and 
meetings. There is an 84-page index classifying the listed materials (by number) 
according to author and subject matter. The bibliography extends a similar one, 
Operations Research: A Preliminary Annotated Bibliography, published in 1952, to the 
end of 1957. 

A useful appraisal of any classification of knowledge requires familiarity with the 
subject matter, the available materials, and the demands and competencies of the 
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persons for whom the classification is prepared. The subject matter of operations 
research is by no means clearly defined. In fact there is a large volume of literature 
addressed to the question, “what is operations research?” The field appears to be 
defined both with reference to kinds of problems and techniques for solution. In 
this reviewer’s judgment, the author certainly has not interpreted the field too 
narrowly. 

Because of the broad definition of the field, materials relating to such methods of 
formulating and solving problems as various forms of programming (linear, non- 
linear, parametric, dynamic, etc.), queueing theory, game theory, and simulation as 
well as to broad classes of knowledge such as statistics, mathematics, economic 
analysis, accounting, and psychology could be considered as operations research 
materials. To provide a comprehensive listing of books and papers from all of these 
areas that have applications to operations research is an accomplishment that can- 
not reasonably be expected of a single bibliographer. However, a user of the bibliog- 
raphy would hope that really important things are listed in the bibliography and 
that the work contains a minimum of materials of dubious value. 

In the reviewer’s opinion, an excessive number of works of dubious value (particu- 
larly newspaper reports and items from trade or business journals) have been listed. 
Perhaps the author wished to compile a bibliography for both businessmen and 
research workers or practitioners. Also, he states that he prefers to minimize errors 
of omission by accepting the author’s or editor’s indication that the work pertains 
to operations research. However, this procedure apparently is not always sufficient 
for among the items of importance that are omitted is Feller’s An Introduction to 
Probability Theory and Its Applications. 

The bibliography should prove useful. Its detailed index and the abstracts are 
particularly attractive features. It classifies some unimportant materials and omits 
some important ones; but can there be a bibliography on this subject which doesn’t 
make such mistakes? 


Variables Related to Human Breast Cancer. V. Elving Anderson, Harold O. Goodman 
and Sheldon C. Reed. Minneapolis: The University of Minnesota Press, 1958. Pp. xiii, 
172. $4.00. 


Harowp F. Dorn, National Institutes of Health 


HE development of cancer by two or more closely related members of the same 

family has attracted the attention of physicians and geneticists for at least two 
hundred years. This fact was first interpreted by many as demonstrating that cancer 
is a genetically determined disease. It was gradually recognized, however, that 
multiple cases in the same family might be merely coincidental rather than an 
expression of inherited susceptibility and that different types of cancer probably are 
separate diseases with different specific etiological agents. Furthermore, statistical 
studies of the incidence of cancer in various populations revealed large differences 
in morbidity and mortality from cancer associated with demographic, cultural and 
environmental characteristics, which suggested that both exogenous and endogenous 
factors influence the development of cancer. Except for a few rare forms of human 
cancer the disentanglement of the relative importance of extrinsic and intrinsic 
factors in the development of cancer has proven to be unexpectedly difficult. 

The authors of this study sought the answer to the question, “do the relatives of 
breast cancer patients have an increased risk of developing cancer, and if so, is this 
an increased risk for cancer in general or for specific forms of cancer?” The book 
begins with a concise review of the methods of study used by previous investigators 
and of the problems encountered in the interpretation of the results of these studies. 
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After reviewing the advantages and disadvantages of the three principal methods— 
the study of family pedigrees, the analysis of the history of twins, and the analysis 
of data about relatives of persons with cancer—the authors selected the latter 
method as the most suitable for their purpose. 

The remainder of the book is devoted to a thorough discussion of the four main 
problems that are encountered in studies of this kind, (a) how shall the index cases 
or propositae be selected, (b) what control or comparison group shall be used, (ec) 
how can the data be validated, (d) what are the most appropriate methods of analysis. 
Each problem is discussed in meticulous detail, procedures taken to solve the problems 
are clearly set forth, and the results are frankly evaluated. 

The data are from the histories of the families and relatives of 544 women who 
were examined at the Tumor Clinic of the University of Minnesota Hospitals. The 
control group was the parents and sibs of the husbands of these women. As a check 
upon the conclusions drawn from the analysis of the histories of these persons, a 
second set of propositae was chosen from the records of the University of Minnesota 
hospitals for the years 1910-1925 and from the University of Minnesota Pathology 
Department records for the years 1910-1920. 

The data for each of the separate groups studied are carefully described and thorough- 
ly analyzed for possible biases that might affect the interpretation of the data. The 
variation in the conclusions that would be drawn by using different control groups 
is illustrated. A variety of methods of adjusting and analyzing the data are tried 
and the results are contrasted. Throughout, special attention is paid to the validity 
of information used. This book should be a required study for all investigators 
planning to undertake a retrospective population study of the etiology of disease. 
In its detailed description of methodology, thorough search for possible biases in the 
data that might affect the interpretation of the results, the examination of the effect 
of choosing different control or comparison groups, the exposition of the different 
conclusions reached by alternative methods of analysis, and the competent and 
critical handling of the data, this study sets a high standard for future investigators. 


State Income Differentials, 1919-1954. Frank A. Hanna. Durham: Duke University 
Press, 1959. Pp. xix, 268. $7.50. 


Ernest W. Grove, U. 8. Department of Agriculture 


To final report of the Study of Differences in State Per Capita Incomes, financed 
jointly by Duke University and the Rockefeller Foundation, this volume is an 
intensive survey and detailed statistical analysis of the income data available for 
States. Differences among States, and changes in those differences, are studied in 
terms of the official Department of Commerce estimates of State personal income 
back to 1929 and the 1919-21 estimates prepared by Maurice Levin for the National 
Bureau of Economic Research. 

Time series analysis, attempting to separate cyclical and secular from random 
elements in income differentials, is supplemented by cross-section analysis based on 
1949 income data obtained in the 1950 Census of Population and earnings data for 
1947 and 1948 from the Censuses of Manufactures and Business. The cross-section 
analyses concentrate on “explanatory” variables associated with State differences in 
per capita incomes, principally occupation and industry. 

Much of the material in this volume has appeared in earlier publications, notably 
in the volume on “Regional Income”! reporting the proceedings of the June 1955 
meeting of the Conference on Research in Income and Wealth. There is some new 





1 Volume 21 of Studies in Income and Wealth, National Bureau of Economic Research, Princeton University 
Press, 1957. 
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material, however, and it was worth while to bring it all together in this final report 
comprising 7 chapters and 3 appendices. 

Chapter 1 is an excellent summary, not too brief, for the benefit of readers who do 
not wish to follow all of the detailed presentations in subsequent chapters. Chapter 
2 analyzes the “Patterns of Differential Change” in State per captia incomes. The 
rank order of States has not changed very much since 1919, but there has been a 
pronounced narrowing of relative interstate differentials as measured by coefficients 
of variation. These coefficients tend to move contracyclically. In other words, as 
national income per capita increases, its relative interstate dispersion tends to decrease, 
and vice versa. Professor Hanna calls this the “accordion effect.” 

Some regression of the State per capita incomes toward their mean as income 
increases is indicated. Hanna proceeds to the measurement of these regression lines 
in very great detail. Extensive work based on the old State income payment series 
developed regression analyses of the changes in State per capita incomes between all 
possible pairs of years from 1929 to 1949, and between each of the years 1919-21 
paired with each other and with 1929, 1933, 1939, and 1949. This formidable statistical 
undertaking was supplemented by some analyses along the same line with the newer 
personal income data for 1929-54. These intertemporal regressions are the weakest 
element in the whole volume, for their results are not onlv inconclusive (e.g., as 
to linearity), but also lacking in any real significance. Some experimentation and 
summary presentation of the results would have been justified, but the details in 
Appendix A on “Regression and Differential Change” might well have been omitted.? 

Chapter 3, “Cyclical Behavior of Differentials,” develops “sensitivity” and “time- 
factor” indexes for each State, designed to measure cyclical and secular changes, 
respectively. Because farm income is irregular in its behavior—actually negative in 
some States for some years in the 1930’s—the indexes were computed separately for 
farm and nonfarm income as well as the total. The results indicate that lower income 
States are generally more sensitive to cyclical changes than higher income States. 
They also have more and bigger upward trends, as indicated by the time-factor in- 
dexes. 

Chapter 4 subjects the per capita “Income Components”—labor income, property 
income, transfer payments, farm proprietors’ income, and nonfarm proprietors’ in- 
come—to the same type of analysis. The “accordion effect” is found for all compo- 
nents except farm proprietors’ income, which was also the only component to show an 
increase in relative interstate dispersion from 1929-54. It is concluded that the down- 
ward trend in the relative interstate dispersion of total personal income per capita is 
due in large measure to downward trends in the relative size and interstate dispersion 
of property income. Appendix C contains the detailed State estimates of the per capita 
income components. 

Chapters 5 and 6 deal with “Occupational Earnings” and “Industrial Earnings,” 
respectively. In each case, a “rate-constant earning” is computed for each State, the 
national average occupational or industrial earnings rates being applied to the occupa- 
tional or industrial distribution of the individual State. In other words, each State is 
“standardized” for earnings rates, yielding an average rate whose deviation from the 
national average is a reflection of the State’s deviation from the national average in- 
dustrial or occupational composition. In both cases, rate-constant earnings above the 
national average are mostly in the Middle Atlantic and East North Central regions, 
whereas those below the national average are mostly in the South. 

There is a tendency for States with unfavorable occupational or industrial composi- 
tions to have earnings rates below the national averages for their industries and occu- 





* As Edward F. Denison suggested 5 years ago (ibid., p. 175). 
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pations; and conversely, there is a tendency for States with favorable compositions to 
have earnings rates above the national averages. This correlation, unfortunately, 
makes impossible any exact assessment of the relative importance of rates and com- 
position in accounting for interstate differences. Hanna does the best he can in the 
face of this difficulty, and devotes Appendix B to a general discussion of the problem 
of standardization. 

In Chapter 7, “Demographic Factors,” several population variants, all more closely 
associated with income production than is total population, were substituted as de- 
nominators in computing per capita income. States with larger-than-average percent- 
ages of their population of working age are in the Northeast or on the Pacific Coast. 
Those with larger percentages under or over working age are in the Southern and 
Mountain regions. 

This study was needed. It was well conceived, and its execution has been conscien- 
tious and painstaking. The statistical techniques employed are unexceptionable, and 
they have been applied with great diligence. The original claim staked out by Pro- 
fessor Hanna was an important one, and he has worked it thoroughly. Although the 
ore proved out at a lower grade than was originally expected, much good metal was 
produced nevertheless. 

The results of this study will certainly be of value to State planning and develop- 
ment commissions, and perhaps also to local agencies concerned with urban develop- 
ment. Students of economic growth may find some new grist for their presently 
fashionable mill, but the results generally seem a little disappointing on this score. 
Familiarity with the results from previous publications may be partly responsible. 
But mainly it is due to the relatively short period covered by the study, and to the 
fact that generally rising incomes in the last 20 years have made it difficult to separate 
cyclical from secular forces affecting State per capita incomes. Considering the limita- 
tions inherent in the study, its results are generally quite reasonable—and not un- 
expected. But the author cannot be blamed for the absence of any startling conclu- 
sions. 


Secret Detention by the Chicago Police. American Civil Liberties Union, Illinois Division, 
Glencoe, Illinois: The Free Press, 1959. Pp. 47, $1.00 (paper). 


Dante Guaser, University of Illinois 


are pamphlet reports a tabulation of time between arrest and booking for a sample 
of cases tried in Chicago’s Municipal Court. The law requires that arrestees be 
brought before a magistrate ‘without unnecessary delay,” but the study found that 
in the nine branches of the court on which it reports, 25 per cent were held over 17 
hours, and 2.5 per cent over 48 hours. In one branch, 50 per cent were held over 17 
hours and 9 per cent over 48 hours. This conclusion is based on time of arrest and of 
booking entered by police on their Arrest Slips, but omits a third of the slips, in which 
the police left these sections blank. Furthermore, the period between arrest and book- 
ing may constitute only the first of several “unnecessary delays” in bringing an ar- 
restee to court. 

Sampling for this research was initiated by a lawyer who pulled ten or twenty files 
from each of several cabinet drawers. Statisticians Kruskal and Meier of the Univer- 
sity of Chicago were consulted later. They contributed appendices describing and ap- 
praising the sampling procedures, and estimating confidence limits on the assumption 
that the data approximate a probability sample. Two possible types of bias may be of 
interest. First, of course, the effort to sample mechanically all parts of the file drawer 
means that thicker files have more chance of being drawn. Secondly, personal bias of 
the tabulators for thicker or thinner files may develop, for example, if the thinner files 
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material, however, and it was worth while to bring it all together in this final report 
comprising 7 chapters and 3 appendices. 

Chapter 1 is an excellent summary, not too brief, for the benefit of readers who do 
not wish to follow all of the detailed presentations in subsequent chapters. Chapter 
2 analyzes the “Patterns of Differential Change” in State per captia incomes. The 
rank order of States has not changed very much since 1919, but there has been a 
pronounced narrowing of relative interstate differentials as measured by coefficients 
of variation. These coefficients tend to move contracyclically. In other words, as 
national income per capita increases, its relative interstate dispersion tends to decrease, 
and vice versa. Professor Hanna calls this the “accordion effect.” 

Some regression of the State per capita incomes toward their mean as income 
increases is indicated. Hanna proceeds to the measurement of these regression lines 
in very great detail. Extensive work based on the old State income payment series 
developed regression analyses of the changes in State per capita incomes between all 
possible pairs of years from 1929 to 1949, and between each of the years 1919-21 
paired with each other and with 1929, 1933, 1939, and 1949. This formidable statistical 
undertaking was supplemented by some analyses along the same line with the newer 
personal income data for 1929-54. These intertemporal regressions are the weakest 
element in the whole volume, for their results are not only inconclusive (e.g., as 
to linearity), but also lacking in any real significance. Some experimentation and 
summary presentation of the results would have been justified, but the details in 
Appendix A on “Regression and Differential Change” might well have been omitted.” 

Chapter 3, “Cyclical Behavior of Differentials,” develops “sensitivity” and “time- 
factor” indexes for each State, designed to measure cyclical and secular changes, 
respectively. Because farm income is irregular in its behavior—actually negative in 
some States for some years in the 1930’s—the indexes were computed separately for 
farm and nonfarm income as well as the total. The results indicate that lower income 
States are generally more sensitive to cyclical changes than higher income States. 
They also have more and bigger upward trends, as indicated by the time-factor in- 
dexes. 

Chapter 4 subjects the per capita “Income Components”—labor income, property 
income, transfer payments, farm proprietors’ income, and nonfarm proprietors’ in- 
come—to the same type of analysis. The “accordion effect” is found for all compo- 
nents except farm proprietors’ income, which was also the only component to show an 
increase in relative interstate dispersion from 1929-54. It is concluded that the down- 
ward trend in the relative interstate dispersion of total personal income per capita is 
due in large measure to downward trends in the relative size and interstate dispersion 
of property income. Appendix C contains the detailed State estimates of the per capita 
income components. 

Chapters 5 and 6 deal with “Occupational Earnings” and “Industrial Earnings,” 
respectively. In each case, a “rate-constant earning” is computed for each State, the 
national average occupational or industrial earnings rates being applied to the occupa- 
tional or industrial distribution of the individual State. In other words, each State is 
“standardized” for earnings rates, yielding an average rate whose deviation from the 
national average is a reflection of the State’s deviation from the national average in- 
dustrial or occupational composition. In both cases, rate-constant earnings above the 
national average are mostly in the Middle Atlantic and East North Central regions, 
whereas those below the national average are mostly in the South. 

There is a tendency for States with unfavorable occupational or industrial composi- 
tions to have earnings rates below the national averages for their industries and occu- 
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pations; and conversely, there is a tendency for States with favorable compositions to 
have earnings rates above the national averages. This correlation, unfortunately, 
makes impossible any exact assessment of the relative importance of rates and com- 
position in accounting for interstate differences. Hanna does the best he can in the 
face of this difficulty, and devotes Appendix B to a general discussion of the problem 
of standardization. 

In Chapter 7, “Demographic Factors,” several population variants, all more closely 
associated with income production than is total population, were substituted as de- 
nominators in computing per capita income. States with larger-than-average percent- 
ages of their population of working age are in the Northeast or on the Pacific Coast. 
Those with larger percentages under or over working age are in the Southern and 
Mountain regions. 

This study was needed. It was well conceived, and its execution has been conscien- 
tious and painstaking. The statistical techniques employed are unexceptionable, and 
they have been applied with great diligence. The original claim staked out by Pro- 
fessor Hanna was an important one, and he has worked it thoroughly. Although the 
ore proved out at a lower grade than was originally expected, much good metal was 
produced nevertheless. 

The results of this study will certainly be of value to State planning and develop- 
ment commissions, and perhaps also to local agencies concerned with urban develop- 
ment. Students of economic growth may find some new grist for their presently 
fashionable mill, but the results generally seem a little disappointing on this score. 
Familiarity with the results from previous publications may be partly responsible. 
But mainly it is due to the relatively short period covered by the study, and to the 
fact that generally rising incomes in the last 20 years have made it difficult to separate 
cyclical from secular forces affecting State per capita incomes. Considering the limita- 
tions inherent in the study, its results are generally quite reasonable—and not un- 
expected. But the author cannot be blamed for the absence of any startling conclu- 
sions. 


Secret Detention by the Chicago Police. American Civil Liberties Union, Illinois Division, 
Glencoe, Illinois: The Free Press, 1959. Pp. 47, $1.00 (paper). 


Dante Guaser, University of Illinois 


Tt pamphlet reports a tabulation of time between arrest and booking for a sample 
of cases tried in Chicago’s Municipal Court. The law requires that arrestees be 
brought before a magistrate “without unnecessary delay,”’ but the study found that 
in the nine branches of the court on which it reports, 25 per cent were held over 17 
hours, and 2.5 per cent over 48 hours. In one branch, 50 per cent were held over 17 
hours and 9 per cent over 48 hours. This conclusion is based on time of arrest and of 
booking entered by police on their Arrest Slips, but omits a third of the slips, in which 
the police left these sections blank. Furthermore, the period between arrest and book- 
ing may constitute only the first of several “unnecessary delays” in bringing an ar- 
restee to court. 

Sampling for this research was initiated by a lawyer who pulled ten or twenty files 
from each of several cabinet drawers. Statisticians Kruskal and Meier of the Univer- 
sity of Chicago were consulted later. They contributed appendices describing and ap- 
praising the sampling procedures, and estimating confidence limits on the assumption 
that the data approximate a probability sample. Two possible types of bias may be of 
interest. First, of course, the effort to sample mechanically all parts of the file drawer 
means that thicker files have more chance of being drawn. Secondly, personal bias of 
the tabulators for thicker or thinner files may develop, for example, if the thinner files 








590 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1960 


yield the documents sought more readily, or if they are more often completed in the 
thicker files. 

This is only the most recent of a long series of such statistical “exposures” of the gap 
between prescription and practice in American law. The consistency of findings in 
various jurisdictions suggest that similar illegal practices can be found today in most 
American cities, and regardless of laws excluding illegally obtained evidence, which 
this pamphlet recommends. This suggests the futility of “crying with alarm,” rather 
than extending the research to expose correlates of the defects, which might suggest 
remedies. 

That the latter possibility was overlooked in this study is suggested by the last 
Appendix, which reports some findings on eleven branches of the court in which 352 
cases were sampled, but which were excluded in favor of concentrating on 2,038 cases 
from nine other branches. Less than five per cent of the excluded cases were delayed 
over 17 hours, and only nine per cent of these cases had no time information. Five of 
the excluded branches, from which 132 cases were drawn, had no delay over 17 hours. 
It may well be that a much greater contribution to the reduction of unnecessary de- 
lay might have been made through contrast of the branches to which one might “point 
with pride” with those branches which yielded the findings which the pamphlet de- 
plores. Within the branches studied, a comparison of the delayed with the non-delayed 
cases would have been of interest. 

Social action organizations can contribute to social science by pointing out problems 
with which the public is concerned. They generate an often necessary heat. But social 
scientists must restudy the problems, and be less exclusively concerned with raising 
temperatures, if there is to be light. 


A Manual for Economic and Functional Classification of Government Transactions- 
United Nations, Department of Economics and Social Affairs. New York: Columbia 
University Press, 1958. Pp. x, 188. $2.00. 


Nancy D. RUGGLES, Yale University 


HE United Nations Manual for Economic and Functional Classification of Govern- 

ment Transactions is the result of a gradual evolution of work on this subject in the 
Fiscal and Financial Branch of the United Nations. As early as 1951 the United Na- 
tions had published a study entitled Budgetary Structure and Classification of Govern- 
ment Accounts (Sales No. 1951.X V1.3). The purpose of the manual under review is to 
develop classification schemes which will provide meaningful data on government 
transactions. It is unfortunately true that despite the central role of government in 
most economies today the traditional forms of public sector accounts do not provide 
the kinds of information required for making economic and social policy decisions. 
The traditional appropriation accounts are concerned with providing funds to specific 
administrative units within the government, and these requests are usually cast in 
terms of the object of expenditure (such as wages, travel, buildings, etc.) rather than 
the purpose or function of the expenditures. Furthermore, they are usually prepared 
at an extremely detailed level, without the sorts of summarization that are useful for 
policy purposes. On the other hand, the government sector in the national income 
accounts is also grossly inadequate for policy-making purposes. Although economic 
classifications are generally used in the national economic accounts, the degree of con- 
solidation is such that only the very broad outlines of government activity are re- 
vealed. Furthermore, as in the appropriation accounts, little or no information is pro- 
vided on the purpose or function of the various kinds of government activities. The 
United Nations Manual attempts to remedy this situation by setting forth a system 
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of tables into which the basic government accounts can be reclassified. Since the in- 
troduction of the Manual, a number of countries have undertaken reclassification 
projects along the lines suggested in the Manual. 

The Manual is primarily concerned with the development of procedures for obtain- 
ing an economic classification of government expenditures. For this purpose four types 
of governmental entities are distinguished. These are (1) government trading enter- 
prises, which produce goods and services for sale to the general public but are obliged 
to surrender any surplus they may earn to the general government funds; (2) ancillary 
agencies, which are organized to supply goods and services for use of individual 
government agencies but which do not provide services to the public; (3) government 
financial enterprises, which are organized for purchase or sale of financial claims, issu- 
ing domestic currency, foreign exchange, etc.; and (4) general government agencies, 
which are financed primarily by specific appropriations from general government 
funds. For each of these types of agency, on both the central and the local levels, the 
Manual sets up a system of economic classification of transactions, distinguishing such 
flows as current expenditures on goods and services, transfers, subsidies, interest on 
government debt, etc. Considerable attention is given to setting up sets of accounts 
which are appropriate to the different types of government agencies, and to the prin- 
ciples of consolidating the accounts for individual sectors into a general account for 
the government as a whole. 

Secondarily, the Manual is concerned with the principles of functional classification 
of government expenditures. A specific functional classification scheme is proposed, in 
which the main categories are general services, community services, social services, 
economic services, and unallocable expenditures. General services include such things 
as general administration, defense, justice, and police. Community services cover 
roads, fire protection, and other community services. Social services cover education, 
health, social security, housing, and recreation. Economic services cover aids to agri- 
culture, fuel and power, mineral resources, transportation, and similar services. Un- 
allocable expenditures include such things as interest on the debt, inter-governmental 
transfers, and foreign economic aid. The functional classifications are linked with the 
economic classifications, providing a cross classification for the government sector as a 
whole of expenditures by both economic character and function. 

Finally, a chapter is devoted to the reconciliation of the government sector accounts 
as shown in the Manual with the government sector in the United Nations system of 
national income accounts (A System of National Accounts and Supporting Tables, UN 
Sales No. 1952.X VII.4). Considerable difficulty arises in this endeavor, however, inas- 
much as the United Nations system of national income accounts was not designed 
with any thought of integration with any other form of economic accounts. The at- 
tempt at integration is, therefore, somewhat strained and complex. The government 
sector accounts themselves can serve a useful function in providing information for 
policy makers, but it seems doubtful whether this function would be enhanced by 
this kind of reconciliation with the national income accounts. There is unquestionably 
a need for increased attention to the integration of all of the forms of national eco- 
nomic accounting, but the need goes far deeper than the provision of complex recon- 
ciliation formulae for basically dissimilar accounting systems. It might therefore be 
far better not to force the reconciliation of the government sector accounts with the 
national income accounts at this point, but rather at some juncture to reconsider the 
redesign of the national income accounts themselves so that they can be logically 
integrated not only with policy oriented public sector accounts, but with such other 
forms of economic accounting as input-output and flow of funds accounts. 
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The Measurement of National Wealth, Income and Wealth Series VIII. Raymond Gold- 
smith and Christopher Saunders, Eds. London: Bowes and Bowes, 1959. Pp. xiv, 389. 45s. 


Morpecual E. Krernin, Michigan State University 


LOSELY related to the development of aggregate economic analysis was the intro- 

duction and widespread adoption of national income accounting. While these ac- 
counts proved extremely useful in filling the data requirements for the Keynesian ana- 
lytical framework, it is becoming increasingly apparent that they must be supple- 
mented by other sets of accounts such as estimates of national wealth. Not only do 
these estimates serve as an essential guide for policy measures directed at short-run 
economic fluctuations; they are also indispensable for a thorough understanding of the 
long-run process of economic growth. 

The volume under review represents a compilation of national wealth data in a 
variety of countries. The first chapter, a summary survey by Th. D. Van Der Weide, 
contains statistics on national wealth for eighteen nations from five continents. For 
each economy it provides the structure of national wealth at a recent time and some 
capital-output ratios over time. No attempt is made to analyze the data, the task of 
interpretation being left to chapters 4 through 13,! each of which is devoted to an indi- 
vidual country or region. The chapters are written by experts closely associated with 
the development of the accounts in their respective countries. (Incidentally, it would 
have been useful to know the professional association of the authors.) In order of ap- 
pearance, these countries are: Norway, the Netherlands, Germany, Yugoslavia, 
Canada, Latin America, Argentina, South Africa, Australia and India. Although the 
amount of data available varies greatly from one country to another, all the chapters 
have certain elements in common. 

First they all explore in detail the methodological problems involved in compiling 
the estimates, such as sources of data and methods of estimation. There are two basic 
methods for estimating the stock of capital assets. “The first is the ‘perpetual inven- 
tory method’ consisting of the cumulation of statistics of gross capital expenditures 
over time, making allowance for depreciation or disappearance of old assets and for 
‘changes in prices” (p. 4). This procedure requires: a) estimates of capital expenditures 
over long periods in the past; b) some basis for estimating the length of life of capital 
goods; and c) price series for capital goods. The second method is a direct estimate of 
the stock of assets in use at a given time, derived from official censuses, fire insurance 
valuations, the balance sheets of enterprises, capitalization of income, and tax statis- 
tics. The method used in any particular instance depends largely on the types of infor- 
mation available. 

Second, all chapters present estimates of aggregate national wealth and its composi- 
tion. For many of the countries these estimates go back over a protracted period of 
time. Although the introductory chapter suggests a uniform scheme for presenting the 
composition of wealth (p. 4),? there are great variations in the manner of presentation 
followed by individual authors. Third, most chapters contain capital-output ratios 
for the entire economy either over time or at a given time. 

While the mere compilation of these statistics is in itself of great value, the general 
economist would be more interested in interpretation of the data. In that respect the 





1 Chapters 2 and 3 are devoted to methods of measuring capital and to fi ial ting, respectively. 
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chapters on Norway (4) and Latin America (9) are probably the most outstanding. 

After presenting the wealth and income estimates for Norway for the first half of 
the 19th century, the authors (O. Aukrust and J. Bjerke) compute marginal capital- 
output ratios for seven sub-periods. From an approximate level of 3 during 1900-1939, 
the capital-output ratio dropped in World War II, but rose sharply after 1948 reach- 
ing 5.5 in 1951-56. These movements are readily explained in terms of shifts in em- 
ployment and changes in the marginal productivity of capital. A Cobb-Douglas pro- 
duction function is subsequently fitted to the data for 1900-1929 and 1946-1955 with 
the following results: 


Ye = 2.262- 9-208. [,,9-763. ¢0.01818 


“(1) A partial increase in the volume of real capita! by 1 per cent will, ceteris pari- 
bus, raise the national product by 0.2 per cent. (2) A partial increase of labor by 1 per 
cent will raise the national product by 0.76 per cent. (3) With constant capital volume 
and constant employment the national product will, as a result of gradually improving 
‘techniques,’ increase at the rate of approximately 1.8 per cent per annum” (p. 106). 

Although this formulation is subject to some important limitations (discussed on 
p. 109), several inferences can nonetheless be drawn from it. The role of capital in in- 
creasing production appears to be considerably smaller than is generally assumed. 
“The rate of growth of the national product is affected comparatively little by the 
level of investment. .. . Whether the rate of investment within reasonable limits is 
high or low, the national product with constant employment will rise by 2-3 per cent 
per annum, largely because the technical factor alone automatically warrants a 
growth which has been estimated roughly 1.8 per cent® per annum.” One implication 
of this finding is that in an economy like Norway, it is not necessarily desirable to pur- 
sue a stringent policy designed to stimulate high level of investments. Better results 
may perhaps be achieved by diverting resources from investments in reproducible 
capital to investments in human resources such as education and research. 

In the case of Latin America (chapter written by A. Ganz), the analysis is focused 
on the behavior of the average capital-output ratio over time. The decline of that 
ratio between 1925 and 1948 reflected a continuous increase in the productivity of 
capital associated with the process of development, industrialization and better util- 
ization of industrial capacity. Since 1948, however, the ratio remained about constant. 
Factors such as the inefficient use of land resources and misutilization of public in- 
vestment resources (p. 234) were found responsible for halting the increased produc- 
tivity of capital. Following the general analysis of Latin America, economic develop- 
ment in Argentina and Colombia are contrasted and explained in terms of the com- 
position of investments and capital. 

Other chapters are also not lacking in analytical insights. In West Germany (Chap- 
ter written by F. Griinig) the decline in the capital-output ratio during the postwar 
period was a reflection of the increased importance of manufacturing as compared 
with transport and housing and of the increasing rationalization of the production 
process. The studies of South Africa (authored by D. Franzsen and J. Willers) and 
Australia (by J. Garland and R. Goldsmith) use the accumulation of national wealth 
as a focal point in explaining economic development during the past half a century. 
As may be expected, a major role in the development process is assigned to changes in 
the terms of trade. 

On the basis of the data gathered thus far Mr. Ivo Vinski, author of the chapter on 
Yugoslavia, suggests the following tentative conclusion about the order of magnitude 
of the capital-income ratio in various stages of economic growth (p. 165): 





4 Note that one limitation of the analysis is that it the “technique” factor to be independent of the 
capital factor. 
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“1) In under-developed countries this ratio seems to be low, probably under 2.5, 
because labor is the main factor of production in these countries. . . . 2) In countries 
undergoing the process of accelerated basic industrialization the ratio may rise to 
over 4.0....3) In what are now advanced countries the ratio generally ranges be- 
tween 2.5 and 3.5, the decline being caused by a more economical utilization of 
capital, increase of productivity, inventions, etc.” 


The foregoing paragraphs illustrate some uses to which estimates of national wealth 
can be put, especially when they correspond well to national income data. Attempts to 
explain the historical behavior of measures such as the capital-output ratio may lead 
not only to better understanding of the development process, but also to important 
policy conclusions in the fields of development and resource allocation. The volume 
under review is therefore a welcome addition to the growing literature on the subject. 


Studies in Company Finance, A Symposium on the Economic Analysis and Interpretation 
of British Company Accounts. Eds., Brian Tew and R. F. Henderson. Cambridge, Eng- 
land: Cambridge University Press, 1959. Pp. xix, 301. $6.50. 


Epwin Kun, Massachusetts Instiiute of Technology 


H1s book is a collection of essays analyzing balance sheets and balance sheet data 

for about 2500 to 3000 firms whose securities are traded on the British Stock Ex- 
change and whose products fall within the broad categories of manufacturing, build- 
ing, or distribution for the years 1949-53. The essays have one unifying theme—all 
are analyses of a homogeneous body of data, either sources and uses of funds state- 
ments or balance sheets, which have been put into standardized accounting frame- 
work so that frequency distributions, principally in the form of ratios, will not be dis- 
torted by capricious inter-firm accounting differences. This approach represents an 
interesting and potentially extremely fruitful collaboration between the British 
Government and a private research organization which might well be emulated in 
this country. 

A central tenet is well expressed by the editors in the introductory essay: “We have 
looked upon ourselves as pioneers exploiting a new and rich mine of economic data, 
and like pioneers have not hesitated to exploit first the seam which appeared easiest to 
work.” Before pointing to some limitations of this particular approach, it seems best 
to indicate some of the benefits by reporting the main findings, many of which are of 
very real interest. 

In a discussion of “Dividend Policy and Income Appropriation,” S. J. Prais shows 
that for all firms in the United Kingdom, sharp tax increases in the postwar period fell 
upon dividends instead of retained earnings. Dividends fell from about 70% of income 
in 1938 to 30-35% in 1948-56. Taxation rose from about 15% to 40% of net income 
before taxes over this period while “net savings” rose from 18% to an average of about 
25%. Frequency distributions of retained earnings to net income show that 70% of the 
companies saved between 40% and 80% of their net income. When the same saving- 
income ratio is distributed according to the rate of growth, it is found that the faster a 
firm’s rate of growth, the higher will be its average saving-income ratio. 

In an article of major interest to economists, Brian Tew discusses corporate self- 
financing. Net investment is defined as the increase in net total assets over the period 
1949-53, and corporate saving is defined as net retained earnings, i.e., net earnings 
less dividends. While some small attention is paid to the role of current depreciation 
expense, it is largely in another connection. In my opinion, this is unfortunate because 
the subsequent findings are heavily qualified by this conceptual problem. The ratio of 
saving to investment is positively correlated with the savings-income ratio; the ratio 
of savings to investment is inversely (and strongly) correlated with the ratio of income 
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to investment. Most interesting, they find that the correlation between the log of the 
savings-income ratio and the log of the investment-income ratio is ‘remarkably low, 
only 0.15... .” It appears, then, that the cross-section correlation between savings 
and investment, both divided by corporate income, is very small. Tew also considers 
other internal sources of finance including the change in net liquid position which un- 
fortunately is defined as liquid assets, i.e., cash, plus tax reserve certificates, plus 
marketable securities, minus bank credit outstanding and current account accrual 
such as for dividends, interest, and taxes. Repeated use of net accounts, when ob- 
served data may often be attributable to diverse underlying behavior appears to be a 
shortcoming common to much of the anatyain, although clearly opinions on this score 
can differ quite legitimately. 

R. F. Henderson develops evidence on a clearly defined relation between the rate of 
growth of net assets and new issues. He also reports for the aggregates that the large 
companies financed a larger proportion of growth by new issues than did small com- 
panies and placed particular reliance upon common stock issues. Bank credit was of 
trivial importance in financing the growth of British firms as a whole, although it was 
of real importance to certain classes of companies. During this period, small firms did 
not rely disproportionately on net trade credit (i.e., accounts receivable less accounts 
payable), although the fastest growing companies found trade credit to be a use rather 
than a source of funds. Trade credit is primarily used in a fairly rigid relation to out- 
put and is seldom an important fund source for firms who are short of funds, although 
a severe depression could easily alter the observed pattern. 

Prais in another essay shows that the hundred largest companies increased their 
share of gross trading profit (net income plus depreciation) from about 25.2% in 1949 
to 31.5% in 1955. In every year large company profits either increased more rapidly 
or declined less than for all other companies. Studies using these data for individual 
industries (brewing, cotton cloth, construction, construction materials, electrical 
goods and machinery, and retail distribution) are also included. 

The advantages to different analyses of the sam2 body of data are considerable and 
hardly require further elaboration. The disadvantages (and these for the most part 
have been recognized by the authors) should not be overlooked. First, by ignoring 
supplementary information available in greater detail for a sub-set of firms, the in- 
vestigators perforce sacrifice a good deal of information in pursuit of homogeneity. 
Second, the economies of scale and improvements in quality which accrue when a 
common body of data has been processed according to standard procedures is to some 
extent offset by the fact that the motivation of the investigation is imprecise. 

This study represents a desirable tendency to make available homogeneous and 
relevant data where none existed before. Over a number of years this approach will 
yield great benefits since later investigators with more sharply defined objectives will 
not be strapped by paucity of data. This indeed represents a major advance, particu- 
larly in the age of the high speed electronic computer, when data processing is no 
longer prohibitively expensive. 


A General Theory of the Price Level, Output, Income Distribution, and Economic 
Growth. Sidney Weintraub. Philadelphia: Chilton Company, 1959. Pp. 123. $3.50. 


Hans Brems, University of Illinois 


F N=employment, P = the price level, Q = physical output, and w= the money wage, 
Weintraub finds that the ratio PQ/wN has remained relatively constant during the 
last thirty years. The ratio is called k, and its constancy is called “the law of k.” The 
demonstration of the constancy of k occupies the first half of the book. The demon- 
stration uses extremely primitive tools, i.e., time series for k. The burden of the proof 
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is carried by two diagrams showing k itself as well as an index of k as functions of time. 
Why both? In effect, looking for constancy of the average k, rather than the marginal 
k, implies that a straight line through zero in a PQ-wN diagram represents the best fit. 
Why not a simple regression line with a nonzero intercept? It would have been less re- 
strictive and might have given a better fit. But Weintraub uses no regressions at all. 

The relative constancy of k has been observed by many writers, and it has been 
interpreted in terms of economic behavior before. In Lawrence R. Klein, A Textbook 
of Econometrics, pp. 75-6, for example, one finds a lucid demonstration that if the 
production function is of the Cobb-Douglas type, if markets are purely competitive, 
and if entrepreneurs maximize their profits, then k will be equal to the reciprocal of the 
elasticity of output with respect to employment. In a Cobb-Douglas production fune- 
tion, elasticity is a constant. Weintraub mentions none of these things but displays a 
tremendous enthusiasm about the “magic” constant k. He proceeds to turn his defini- 
tion PQ/wN =k into the behavior hypothesis 


N 
(2.1) P = kw Q 

To do this, he must clearly assume that P is a dependent variable and that w and 
N/Q are both independent variables. As we shall see later, these assumptions are 
crucial for his policy conclusions. 

On page 69 Weintraub is ready for what he calls “the wedding ceremony,” i. e., 
the inclusion of the law of the “magic” k into a macroeconomic model. The ceremony 
is performed within a 6-page chapter which is called “the only chapter that is directed 
primarily to specialists in economics.” The specialist, I am afraid, will benefit no more 
from this chapter than he does from the others. A Keynesian-type macroeconomic 
model of nine equations is set up and is said to determine “nine” unknowns. Unfor- 
tunately, the list of unknowns contains eleven items. Things are not quite as bad as 
the casual reader may fear. It turns out that k has been included among the un- 
knowns; and k is surely the queen of the book’s parameters. So there is one unknown 
too many. Furthermore, there is a verbal afterthought to the effect that the supply of 
money M must be equal to the demand for money L. This provides the unwritten 
tenth equation. The list is still quite confusing, however, for N (employment) and Q 
(physical output) are not included in terms of their absolute values but merely in the 
form of the ratio Q/N. Weintraub cannot mean this literally: he wants to determine 
absolute employment and output. The confusion is heightened further by the use of 
R for the unit wage cost on pages 9 and 48 and the use of R for total profits on page 
72. But suppose the reader fights his way through the confusion, what does he find? 
He finds no discussion of the existence of a solution, let alone a discussion of its prop- 
erties it if exists. Nor is the model used for anything in the later chapters. Even if its 
formal deficiences are repaired the model seems to add nothing to the model presented 
so lucidly, verbally as well as mathematically, by Lawrence R. Klein in his Keynesian 
Revolution thirteen years ago. One of the many merits of the Klein model is the care 
with which it supplies the additional information needed to determine the absolute 
level of prices or wages in addition to what is needed to determine the system in terms 
of real quantities. 

What are the policy conclusions of Weintraub’s book? There are two of them. First, 
inflation being wage-push inflation, the Federal Reserve system cannot fight it with- 
out creating unemployment. Second, inflation being wage-push inflation it might be 
restrained if a Federal agency would collect data on it and show them to Congress. 
The mere threat of informed congressional action might well, Weintraub thinks, 
“counsel restraint.” Whatever one may think of the policy conclusions from the point 
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of view of policy, they do not follow from the analysis of the book. Constancy of k may 
mean that once w is set P follows, or it may mean that once P is set w follows. Wein- 
traub clearly favors the first interpretation, but he offers no reasons why he thinks it 
is superior; worse still, he does not ever seem to have been aware of the existence ui the 
second possibility. 

After chapters on income distribution and growth, reworking the “magic” k a few 
more times, the book ends in a peculiar Concluding Note. Here, new studies are pro- 
mised “to demonstrate that the price level has behaved over the longer past in the 
way our basic growth formula suggests.” And the closing sentence is this: “Consider- 
ing that the formulation descends from the basic truism, I think we are entitled to be 
confident of the results.” But if the formulation descends exclusively from a truism, 
then how could it not be true, and if so, why waste time testing it? On the other hand, 
if it also descends from some forebears other than the truism, how can we be confident? 
This methodological confusion is, I am afraid, typical of the entire book. 

In defense of the book one might say that since it was intended for the layman, no 
really powerful techniques of analysis, statistical or theoretical, could be expected to 
be used in it. Well and good, but in that case the author should not have permitted 
his publisher to make such wild claims as appear on the dust cover. Nor should he 
have permitted himself to make similar claims in the text, comparing his law of the 
“magic” k and its importance to that of Newton’s gravitational constant. 


How American Buying Habits Change. United States Department of Labor. Washington, 
D. C.: 1959. Pp. xiv, 253. $1.00. 


Eva Mvue.ter, University of Michigan 


HE purpose of this monograph is two-fold. One aim is to bring together the results 
Tor six major surveys of consumer expenditures undertaken between 1874-5 and 
1950. The first of these surveys was conducted by the Massachusetts Department of 
Labor, the other five by the U. 8. Bureau of Labor, later the U. 8. Department of 
Labor. Twenty-five detailed tables, some additional text tables, and nine charts are 
presented in an attempt “to make readily visible the main outiines of the several sur- 
veys.” This is not a source book, but a convenient summary of major data. 

The second aim of the monograph is to describe the improvements in living stand- 
ards which American workers have achieved since 1888. Unfortunately, this second 
task is approached only in part by making quantitative comparisons based on the ex- 
penditure surveys. In an apparent attempt to make the monograph interesting and 
lively for the general reader a great deal of miscellaneous descriptive information is 
added. There are quotations from case studies dealing with the living and working 
conditions of immigrants around 1900, accounts of changes in legislation pertaining to 
housing, sanitation, working hours, social insurance, etc. There also are discussions of 
major developments in public health, the growing importance of synthetic textiles 
and of detergents, labor saving devices around the house, the movement of working 
class families to the suburbs, and the like. One wonders whether the reader who is 
interested in the quantitative comparisons will see in the descriptive material any- 
thing but a tedious recital of well known facts. 

The long descriptive text also suffers from annoying exaggeration. When speaking 
of workers at the turn of the century, much emphasis is put on the conditions under 
which slum dwellers and immigrants were forced to live. When the authors turn to the 
1950’s we hear of wage earners who no longer live in an identifiable working class 
neighborhood, are officers of the PTA, go to the same dentist as the salaried groups, 
own air-conditioners, power lawnmowers, outboard motors, and home workshops. 
Practically nothing is reported about the more prosperous segment of the working 
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population around 1900, or about slum dwellers in 1950. The Puerto Rican immigrant 
of today and the worker who lives in his own air-conditioned house are worlds apart in 
their consumption levels. The problem of diversity within the wage earner group is 
buried under excessive emphasis on averages. 

The chapter headings suggest the scope as well as the tone of the volume. Chapter 
I, “The Bell and the Bay Window,” paints a picture of the great material advance of 
American workers without reference to the statistical data. Chapter II provides an 
over-view of changes in family income and expenditure patterns as revealed by the six 
expenditure surveys. The next seven chapters deal with particular areas of consump- 
tion: housing (“From the Slums to Suburbia”), food (Meals, Menus and Market 
Baskets”), household operation, clothing and personal care, health care, transporta- 
tion, and leisure. The last chapter compares the scope and method of all expenditure 
surveys of the Department of Labor. For the non-expert interested in the expenditure 
data or the development of survey techniques, this is a valuable summary; for the pro- 
fessional expert a recent monograph by Helen H. Lamale goes over the same ground in 
more detail. 

In summarizing the expenditure surveys, the monograph performs a distinct serv- 
ice. The material presented could be brought together by individual students of con- 
sumer economics only at the cost of considerable time spent in digging back into the 
original data. By contrast, the researcher interested, say, in exploring changes in 
housing conditions would hardly find the data presented sufficient; the monograph is 
not intended for him. The bulk of the statistical material consists of tables showing 
changes in the distribution of expenditures between categories of consumer goods 
(major and minor) over time and differences between income groups at a given point 
of time. Occasionally there are tables presenting the trend of deflated dollar amounts 
of expenditures over the years. No adjustments are attempted for changes in the 
population groups covered by the surveys, but the text contains some references to 
this problem. Break-downs by such variables as family size, region of the country, or 
occupation are rare, but appear where they are most relevant. There are, in addition, 
data on changes in quantities of various foods consumed; changes in number of vari- 
ous garments purchased by men, women, and children in different parts of the coun- 
try; changes in the distribution of medical care expenditures by type of expenditure; 
uses made of the car by different occupation groups. Finally, there is an interesting 
table which shows the change in standard family budgets over time (pp. 236-8). 

The scope of the monograph is to a large extent determined by the six expenditure 
surveys. Only sporadic use is made of data from other sources. For example, practi- 
cally nothing is said about amounts saved or forms of saving—probably because of the 
weakness of the savings data in the expenditure studies. No attempt is made to enrich 
the housing data by drawing on Census materials also. Advances in the economic wel- 
fare of U. 8. workers might have been placed in some perspective by making compari- 
sons with other nations or with the progress of other groups in the U. S.; but again, 
this would have required some presentation of “outside” data. The monograph depicts 
long term changes from one expenditure survey to the next; cyclical fluctuations in 
consumption patterns again fall beyond the scope of the data. In brief, this is a study 
to summarize a body of data rather than a study which searches for the factors which 
promote and give direction to changes in consumption. 

There is no point in criticizing the monograph for its limited purpose. The idea of 
writing economic history not in terms of changes in economic institutions and legisla- 
tion, but in terms of changes in the fortunes and behavior of consumers is challenging. 
The BLS monograph points up the need for economic histories that focus on people 
and on major analytical variables such as consumption, investment, or the impact of 
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the government sector. It is to be hoped that this monograph will stimulate other his- 
tories of consumption, less restricted in their use of data and more analytical in intent. 
To write such a book obviously is not the task of the U. 8S. Department of Labor. 


The Dynamics of Supply: Estimation of Farmers’ Response to Price. Marc Nerlove. 
Baltimore: Johns Hopkins Press, 1958. Pp. ix, 268. $5.00, Cloth; $4.00, Paper. 


Rosert L. Gustarson, Michigan State University 


HE economist who wishes to apply theory to fact is faced with the necessity of esti- 

mating the parameters of functional relationships, and is plagued by the inade- 
quacy of the available means by which to make such estimates. A large amount of re- 
search effort has been devoted to improving the means, and this study by Marc 
Nerlove constitutes a significant contribution to that effort. The general character- 
istics of this contribution are similar to others: (1) The basic notion is the fitting of 
hyper-planes to observed data or functions of observed data (in essence, some form of 
“least squares”); (2) Demonstration is made that if the fitting is done in some cur- 
rently customary way, the conclusions are likely to be erroneous; (3) More sophisti- 
cated methods of fitting are introduced, corresponding to what are thought to be cru- 
cial aspects of the problem at hand (in the present case, “price-expectation formation” 
and “lags in adjustment”) and making use of concepts from accepted statistical 
methodology (e.g., maximum likelihood); (4) Finally, and especially if some empirical 
applications are made (as they are in this case), it is pointed out that, while the new 
methods may offer some improvement over what was before available, they—perhaps 
inevitably—raise more questions than they answer. 

The book consists of (1) two chapters on “Theoretical Considerations,” the rele- 
vance of some of which does not appear until considerably later; (2) a chapter survey- 
ing some previous studies on agricultural supply functions; (3) four chapters essen- 
tially on the nature of the data; (4) two concluding chapters on “Statistical Analyses.” 
The three commodities whose supply functions are studied are corn, cotton, and 
wheat, in the U. 8., and the “dependent” variable in all cases is acreage, rather than 
the actual quantity of the commodity supplied. 

The book may be recommended, not only to those who have a substantive interest 
in its specific subject-matter, but also to research workers in general in economics and 
statistics, as a good example of (a) the adaptation of statistical methodology to a 
problem area of considerable general significance, and (b) the careful application of 
the methodology to specific data. Chapters V-VII are of interest primarily as a de- 
scription of some aspects of U. S. agriculture. 

In recent years a number of applications of essentially the kind of method which 
Nerlove refers to as the “non-iterative” have been made, both within and outside the 
field of agricultural economics, and in estimating relationships other than supply; in 
some cases with seemingly quite “good” results. The general approach would seem to 
be, at least conceptually or potentially, applicable in a variety of situations where “ex- 
pectations” and/or “lags” are relevant. The nature of Nerlove’s medel, in its simplest 
form, is as follows: The supply function is 


(1) Ze = Ao + aP,* + uu 


where 2; is acreage, a and a; are parameters, P,* is expected “normal” price (not ob- 
servable), and u; is the disturbance. The price-expectation relation is: 


(2) P,* > Py,* + b(Pivn — ran) 


where P;_, is actual price and 6 is the “ceefficient of expectation,” 0 <b <1. To estimate 
o, a, and b, two alternative procedures are used: 
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(1) The “iterative” method: Equation (2) is shown to imply that P,* is (approxi- 
mately) a weighted average of actual past prices, 


t 
P,* = § b(1 — b)** Py 

t=O 
where S denotes sum. This expression is substituted into (1), and least squares 
estimates (which are maximum likelihood under the usual kinds of assump- 
tions) are obtained by a numerical, iterative procedure. As is often the case in 
econometrics, little is known about the small-sample properties of the esti- 
mates. 
The “non-iterative” method: Equation (2) is substituted into (1), and the re- 
sult reduced to: 


Ze = bo + DiP oa + Dotin + % 
where 
bo = bao 
bi = bar 
b =1—b 
Mm = ue — (1 — db) 


The b,’s are estimated by least squares regression applied to (4), and the esti- 
mates of the parameters of interest then obtained from (5). 

The estimates of the elasticity of acreage with respect to expected price (at the 
mean values of the variables), by the iterative and non-iterative methods respectively, 
were: for cotton, 4.53 and 0.67; for wheat, 1.18 and 0.93; and for corn, 0.35 and 0.18. 
(These were based on the simplest model, similar to the one outlined above except 
that a trend term was included. The iterative method was not used on the equations 
which included more variables.) One of the main sources of difficulty is that if the u, 
are independent of each other, then the », cannot be, and vice versa. Use of weighted 
averages exacerbates the problems caused by serial correlation in the disturbances, 
since the averages (P;*) will tend to be serially correlated even if the original prices 
were not. (If the P; were randomly and independently distributed with constant vari- 
ance, then the serial correlation coefficient of the P,* in an infinite sample would be 
1—b.) Intuitively at least, any possible resulting bias would appear to be maximized 
when the weights (i.e., the value of b) are chosen not independently of the sample, but 
so as to maximize the correlation in the sample between z and P*. 

An alternative specification of the simplest model is 


(6) a* = ao + aiP,* + us 


where z;,* is “desired” or “equilibrium” acreage (not observable) ; 
(7) P,* = Py (i.e., b = 1) 


and 
(8) Le = Bea + c(a,* — 21) + we 


where c is the “coefficient of adjustment” and u; is a disturbance. Substituting (6) and 
(7) into (8) gives again (4), with the substitution of c for b in (5) and with v;=cuz+. 
L.e., the two alternative specifications lead to the same estimates (by the “non-itera- 
tive” method), but with different interpretations. 

The writer has not noticed any major “errors” in Nerlove’s study, but, of course, 
simplifying assumptions are necessarily made. Some omissions which might be worth 
keeping in mind when interpreting the results, or when contemplating further research 
in the area are: 


' 
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1) No attempt is made to consider the “simultaneous equations” problem, which 
may be important in the case of a commodity for which the quantity demanded, as 
well as the quantity supplied, depends on past prices. Of the three commodities con- 
sidered, corn, at least, would appear to have a demand function of this nature, since 
the level of production of hogs, and hence the demand for corn, is largely influenced by 
past hog and corn prices. (And it may be noted that Nerlove has the most trouble 
with his regressions for corn.) It is true that at least part of this particular difficulty 
may be obviated by the use of acreage rather than quantity supplied as the dependent 
variable. Even if this is the case, however, the discussion of stability of market equi- 
librium would likely be modified by consideration of lags on the demand side. 

2) Nerlove points out that if an adjustment lag is included in the model as well as a 
price-expectation coefficient not equal to one, then acreage lagged two years as well as 
acreage lagged one year appears in the “reduced” equation with which the estimates 
are made (in the case of the “non-iterative” method). His regressions of Chapter VIII, 
however, include only acreage lagged one year, and he points out (in Chapter IX) 
that the resulting estimates of the long-run elasticity of supply with respect to ex- 
pected price will generally be biased (if, in fact, both influences are operative). His con- 
clusion about the probable direction of the bias appears in error, however. Using his 
notation (footnote 1, page 241), he has E#,< m2, which is all right, but the relevant 
question is whether E#: < 2+ 73, and this can be true only if g.>1, which appears 
highly unlikely. (This is a relevant quibble, since one of the main conclusions of the 
study is that elasticities of supply are probably higher than they have sometimes been 
held to be.) Unfortunately, the attempt to introduce acreage lagged two years, and 
estimate both the expectation coefficient and the adjustment coefficient, in Chapter 
IX, met with little success. It might be mentioned that some other researchers, follow- 
ing Nerlove’s lead and making similar attempts, on different data, have had similarly 
little success. The evidence suggests that the model, as developed so far, is not ade- 
quate to cope with this kind of fairly simple generalization. 

3) “Expected normal price” is assumed to be completely determined by past actual 
prices and a constant coefficient of expectation, with no consideration of other possible 
influences. This seems to be a highly unrealistic restriction, and relaxation of it, at 
least to the extent of including a random disturbance in the price expectation relation 
(if better alternatives are not available) would seem to be worth considering in future 
applications or extensions of the model. It does not appear, however, that simply add- 
ing a disturbance term would, by itself, have any effect (under the usual kinds of 
assumptions) on the interpretation of Nerlove’s own results. 

The book contains a number of sometimes confusing typographical errors. 


The Postwar Rise of Mortgage Companies. Saul B. Klaman. New York: National 
Bureau of Economic Research, 1959. Pp. xv, 102. $1.50. 


Jutius Margouis, University of California, Berkeley 


HE mortgage company is a firm whose principal activity is originating and servic- 
Tine residential mortgage loans for institutional investors. It acts as an intermediary 
between the builder or home owner and the institutions with funds; it services the 
mortgage it originates; it relies on commercial bank credit to hold temporary inven- 
tories of mortgages. Its rapid rise during the postwar period reflects the shifts in the 
structure of the mortgage market which have given rise to the need of new specializa- 
tions. 

. Mr. Klaman has examined in detail the brief history of the postwar rise of the 
mortgage companies. During this period the companies experienced a steady and 
rapid rate of growth. The major question asked by the study is: Why have they 
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grown? His answer can be put succinctly. The companies are the brokers between the 
rapidly growing institutional suppliers of credit, especially the insurance companies, 
and the home builders. The credit suppliers are national while the home builders are 
local and often located in regions distant from the fund suppliers. The introduction 
of the federally underwritten mortgage created the credit instrument which per- 
mitted the easy flow of institutional funds to small local investments. The mortgage 
company creates the contacts among these two groups. Though the development of 
this service is of interest another aspect of the study is potentially far more interest- 
ing. 

Klaman presents a great deal of data about the mortgage company industry: the 


age distribution of companies, their sources of income, size classes, growth rates, asset , - 


structure, uses and sources of funds, geographic distribution and preduction char- 
acteristics. Some of the data were developed by adjustments of material in govern- 
ment files and others were developed from a special survey. The data do allow for 
testing of hypotheses about the growth of firms, economies of scale, adequacy of per- 
formance and other interesting questions which can be raised in an industry study. 
Klaman did not direct his attention to these questions. His analysis was focussed on 
an explanation of the development of the industry. His careful descriptive material 
can prove useful to others who might want to test hypotheses about the structure and 
performance of an industry. 


An Introduction to Electronic Data Processing. Roger Nett and Stanley A. Hetzler. 
Glencoe, Illinois: The Free Press, 1959. Pj. 287. $6.75. 


T. A. Yancey, University of Illinois 


HIS book is concerned with providing a comprehensive discussion of electronic data 
‘Keen systems for the non-mathematically oriented beginner in data process- 
ing. The authors have drawn on their experience obtained in organizing and teaching 
in the data processing training program of the Air Force’s Air Material Command to 
determine the appropriate material to include and the emphasis to be given each as- 
pect of data processing. As a result, the book reads very much like a series of lectures 
might sound when delivered by a good expositor whe enjoys teaching. 

In the first chapter, the authors trace the history and development of automation, 
the rise of the scientific method, and the history and development of computers from 
the abacus to the modern general purpose automatic computer. This is done in an 
interesting if somewhat casual way. Having established the evolution of automation 
and computers in general, analog and digital computers are introduced, and their 
basic differences and advantages are described in non-technical terms by the use of 
analogies and examples. The book is concerned primarily with general purpose digital 
computers of which the most recent UNIVAC’s and IBM 700 series are the primary 
examples. Models from these two series are referred to frequently in connection with 
programming and illustrations of computer components. The discussion of compo- 
ments of computers—control units, storage devices, input-output devices, and arith- 
metic units—includes all types of devices that have been used in the past as well as 
those most often incorporated in currently constructed computers. Most of this sec- 
tion is spent describing a variety of internal storage devices and input-output devices 
and indicating the advantages and disadvantages of each device. 

Chapters III and IV are on binary arithmetic, machine instructions, programming, 
and computer logic. The topics, other than computer logic, are covered quickly in a 
total of forty pages of which only nineteen pages are devoted to flow charting, ma- 
chine instructions, and programming. The book is definitely not at all suitable for any 
but the most sketchy notion of how to program a problem on a computer. As a text on 
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data processing, it would have to be supplemented heavily in this section if program- 
ming is to be taught at all. The section on computer logic impressed the reviewer as 
one that could well be omitted. Here, the authors elaborate on computer circuitry‘ 
some historical notes on discoveries of the behavior of electrons, and some very ele- 
mentary descriptive physics. It is in this section that the jargon of the computer de- 
signer appears, for example, “gating,” “flip-flops,” etc. This is interesting general in- 
formation but not necessary to understanding the use of computers. 

The next two chapters are on computer applications listed sometimes by method of 
analysis, e.g., linear programming, and sometimes by area of application, e.g., apply- 
ing computers to commercial problems. Most of the illustrations are brief and indicate 
little more than that it is possible to program a particular problem on a computer. 

The remaining fifth of the material is devoted to problems involved in the installa- 
tion of a large electronic data processing system in a business firm. The authors spell 
out a number of difficult problems in converting an organization to computer opera- 
tion. There is a discussion of the necessary detailed plans and scheduling of steps in 
the conversion to computers, the training of personnel, the organization of the com- 
puter group, and the sociopsychological impact of introducing computers. 

An appendix of over fifty pages of technical information is supplied to discuss the 
performance characteristics of a number of commercial computers. Most of this space 
is devoted to the products of the two giants in the field. Also, a useful bibliography is 
provided at the end of the book which will be particularly useful to any reader who 
desires more than the most cursory knowledge of computers and their application. The 
brief treatment of topics is not in itself a basis for criticism, but is due to the decision 
to present such a broad sweep of the field of data processing in a book of modest pro- 
portions. Considering the authors’ aim to produce a very readable book on automatic 
computers for a very heterogeneous audience, they have not done badly. 


Handbook of Automation Computation and Control. Volume I (1958) and Volume II 
(1959). Eugene M. Grabbe, Simon Ramo, and Dean E. Wooldridge, Eds. New York: John 
Wiley and Sons, Inc. $17.50 each volume. 


MircHe tt O. Locks, University of California, Los Angeles 


TT set represents a milestone in the unification of knowledge. It is aimed primarily 
at automatic computation, data processing, and control engineering. However, be- 
cause high speed computers are used in so many different branches of science and 
technology, the material covered includes a wide range of applied measurement and 
engineering activities. The result is an encyclopedia-type compendium on techniques 
of the use and design of data processing systems. 

The Handbook is a planned 3-volume set of which the first two have appeared in 
print as of the date this review is being written. Volume I consists of 26 separate arti- 
cles on what is essentially “Applied Mathematics ” with most of the articles written by 
staff members of either the University of Michigan or the General Electric Company. 
Volume II consists of 31 chapters on Programming, Business Data Processing, and 
Computer Design and Engineering. Eleven of the authors of sections of Volume II are 
on the staff of either Thompson Ramo Wooldridge or its subsidiary, the Space Tech- 
nology Laboratories, and the majority of the remainder of the authors come from 
firms or educational institutions in the Southern California or Boston areas. A no- 
table achievement in this volume is the incorporation of a book-size 270-page chapter 
on “Programming and Coding” by John W. Carr III of the University of North 
Carolina. 

Since Volume I deals largely with numerical operations while Volume II has to do 
with applications and technology, the former is of more interest to most statisticians 
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than the latter. Similarly, Volume III which will be on “Systems and Components” 
should not be of interest to many statisticians unless they have overlapping areas of 
responsibility. Therefore the remainder of this discussion will deal only with Volume I. 

Of the articles in that volume, there are three that deal most directly with the sub- 
ject matter of Statistics. These are “Probability” by A. H. Copeland and “Statistics” 
by A. B. Clarke, both of the University of Michigan, and “Digital techniques in sta- 
tistical analysis of experiments” by Joseph M. Cameron of the National Bureau of 
Standards. All of these articles are competently written and contain much useful ma- 
terial. However, if information on those topics is desired, a statistician is likely to 
have available other sources of information. The real merit of the Handbook is that it 
provides a quick reference on other topics related to one’s own work, but in which his 
library may not be extensive, and a bibliography telling him where to look further. 

The other articles in Volume I which should be of the most interest to statisticians 
are the first 11 chapters on “General Mathematics,” the separate chapters on “Nu- 
merical Analysis” and “Information Theory,” and Dr. Arnoff’s 124-page opus on 
“Operations Research.” 

In view of the difficulties in getting different authors to write about related matters 
of substance without at least in part duplicating each other, the editors are to be com- 
mended for assembling a wide variety of articles of quality by different writers with- 
out too much overlapping in subject matter. This Handbook should be useful to re- 
search workers and teachers in the computation and data processing arts for a num- 
ber of years to come. 


Fractional Factorial Experiment Design for Factors at Three Levels. W. S. Connor and 
Marvin Zelen. Washington, D. C.: National Bureau of Standards, 1959. Pp. v, 37. $.30. 
Paper. 


FRANKLIN A. GrayBILL, Oklahoma State University 


HIS is a monograph published by the United States Department of Commerce in 
the National Bureau of Standards Applied Mathematics Series—54. 

This monograph is primarily a catalogue of fractional factorial arrangements for 
3” factorial systems. The arrangements which are catalogued are 1/3, 1/9, 1/27, 1/81, 
and 1/343 replications for various combinations of factors which range from 4 to 10. 
There are also arrangements for grouping into various sizes of blocks. The fractional 
designs which are listed in this monograph are such that no main effects are aliased 
with other main effects or two factor interactions, and such that as few two factor 
interactions as possible are aliased with other two factor interactions. Each design is 
accompanied by a fundamental identity which can be used to show what effects are 
aliased. The first 10 pages is an exposition of the theory of the construction of the de- 
signs, explanation of the tables, and the analysis of variance. The remainder is the 
catalogue of the various fractional factorial designs. 

This monograph seems to me to serve a useful purpose. For anyone interested in 3” 
fractional factorial arrangements this monograph contains many of the patterns. 
However, even though the theory is explained, it is so condensed that it won’t be too 
useful to anyone who is not already well acquainted with the theory of fractional 
factorial designs. I think that the authors never intended this to be an elementary 
treatment of the theory of fractional factorial designs, but included a resume of the 
theory for completeness and to better explain the actual catalogue of arrangements. 

In general the explanations seem to be clear and well written. However, there is an 
unusual statement on page 7. It is “This model can be written as 


y(t1, Z2, Tx, * + +) = n(ti, Le, Ts, +> ) + Bz + €(t1, Z2, Zs, °*- ) 


where y(xi, 22, Zs, °* * ) represents the actual observation of the factorial combina- 
tion, (21, 22, Zs, °° *) represents the true value of the observation - - -.” I don’t 
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understand why the authors chose the words “true value.” It seems to me that it 
would convey more meaning if (x1, %2, 2s,  - +) referred to the “treatment effect” or 
something similar. 


Expectations, Uncertainty and Business Behaviour. Mary Jean Bowman, Ed. New York: 
A Conference Held at the Carnegie Institute of Technology, October 27-29, 1955, under 
the Auspices of the Committee on Business Enterprise Research of the Social Science 
Research Council, 1958. Pp. vii, 202 $2.00 (paper). 


Dovetas G. Hartuie, University of Toronto 


CONFERENCE on research on expectations, uncertainty and business behavior was 

4% held at the Carnegie Institute in October, 1955, under the auspices of the Com- 
mittee on Business Enterprise Research of the Social Science Research Council. This 
paperbacked volume contains the revised papers presented at the conference, some of 
the discussion which followed the papers, and a most useful Introduction written by 
Mary Jean Bowman who, in the words of the foreword, “developed the plans for the 
conference .. . obtained the participation of the authors . . . and obtained their col- 
laboration in revising the papers... .” 

As Bowman points out, it is impossible to briefly summarize the contents of all four- 
teen papers, two notes and the comments of a number of discussants which this 
volume contains, much less evaluate their significance. About all a reviewer can use- 
fully do is to list the principal papers and give some indication of their contents. 

The volume is made up of four parts. The first part consists of two papers and a 
note on the nature and structure of expectations. The first paper, by Nicholas Geor- 
gescu-Roegen, provides an analysis of the nature of the major theories of expecta- 
tions, including the Classical, Frequentist, Carnapian and Subjectivist schools. It also 
includes an evaluation of their relationships to each other and to the development of 
a theory of rational decision making. The author concludes that it is impossible to de- 
rive an objective way of arriving at decisions which involve true uncertainty; and he 
apparently believes that ultimately such decisions must be based on “good judgment.” 
The second paper, by G. L. 8. Shackle, provides a brief restatement of his well-known 
theory of expectations and an attempt to develop an explanation of the advantages 
and disadvantages of liquidity in terms of this theory. The section concludes with 
a brief but interesting Note by Ward Edwards which contrasts Shackle’s theory of 
surprise with a decision model based on non-additive subjective probabilities which 
Edwards and Donald Davidson are developing. 

Part II consists of five papers on the “Psychosocial Approaches to the Study of 
Expectations and Business Behaviour.” The first paper, by Herbert A. Simon, is en- 
titled, “The Role of Expectation in an Adoptive or Behavior Model.” This paper 
seems to me an important contribution to the field, for it mirrors all of the breadth of 
understanding, imagination and incisive analysis which we have come to expect from 
this author. Simon first distinguishes between what he terms programmed and non- 
programmed decisions. The former constitute a class of recurring decisions which are 
dealt with by a well-established, systematic procedure; the latter involve a choice 
selection in which there is “much stirring about, deliberation, discussion (and) often 
vacillation.” Programmed decisions are what economists usually consider “rational 
decisions” and they are the types of decisions to which modern decision theories, of 
the Savage variety, apply. Non-programmed decisions, on the other hand, are what 
economists might term the process of innovation; for “the alternatives of choice are 
not given in advance”; the “means-ends connection between choice and consequences 
are imperfects known”; and a “frame of reference must be chosen before the choice 
can be brought within the limits of human computation.” 

Following a brief discussion of how social institutions stabilize expectations and 
widen the sphere of programmed decisions, Simon turns to a brief analysis of the 
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process of innovation or unprogrammed decision making. It is in this section of the 
paper that Simon’s insights are most decisive, for by observing the remarkable simi- 
larity between the economists’ concept of opportunity cost, and the psychologists’ 
concept of aspiration level he is able to develop an explanation of the innovation proc- 
ess which may prove to be of the greatest significance in bringing these two disciplines 
together in an analysis of this important but difficult question. 

The Simon paper is followed by a paper by George Katona, which discusses the 
role of expectation as part of ex ante data, the psychology of decision making, the 
origin of attitudes and the measurement of expectations. Two papers by psycholo- 
gists G. Patrick Meredith and William E, Henry and a paper by a sociologist, Albert 
Lauterbach, complete this section. While the idiom of these contributions will prob- 
ably be foreign to most of the readers of this journal, the framework provided by the 
Simon contribution makes them more readily understandable and pertinent. 

The two final parts of the volume include papers with an applied orientation. Most 
of them attempt to devise or test models involving expectations, forecasts or plans. 
Part III is basically devoted to the analysis of expectations and time horizons in par- 
ticular industries; while Part IV includes papers which are concerned with the fore- 
casts or plans made by the establishments in the economy as they relate to changes in 
the aggregate level of economic activity. 

Ruth Mack has presented the framework of a raw material inventory model involv- 
ing expectations of future prices applicable to firms which are competitors in the fac- 
tor market. While this model is developed with particular reference to the boot and 
shoe industry, with which the author is so familiar, it probably has wider applicabil- 
ity. Of the three attempts to relate the concept of expectations to empirical work at 
the industry level, this paper appears to be the most successful. Merton J. Peck in his 
paper on inventory behavior in the aluminum industry is at pains to show the rele- 
vence of marginal analysis to an understanding of business behavior under uncer- 
tainty; but it seems more successful as a description of inventory policy in this indus- 
try than as an illustration of the virtues of the marginal analysis. Hans Brem’s paper 
deals with differences in the timing of price and non-price competition in the auto- 
mobile industry. While an interesting analysis of oligopolistic behavior, it does not 
seem to be particularly germane to the subject matter of the conference. 

On the whole the papers included in Part IV have perhaps less novelty than the 
other papers included in the volume. This judgment in part may be the result of the 
greater familiarity of the reviewer with the problems with which they are concerned, 
and in part because of prior publication of their results. The first paper, by Franco 
Modigliani and Kalman J. Cohen, is entitled: “The Significance and Uses of Ex-Ante 
Data.”! This paper lays the foundation of the “realization function” approach to the 
use of ex ante data which Modigliani has since applied in the article written with 
Weingartner (which appeared in the February, 1958, issue of the Quarterly Journal of 
Economics). Because the authors are in the process of revising their basic work in this 
field readers of the article under review should bear in mind that it may no longer rep- 
resent their current views, at least in detail. 

The usefulness of their approach is attested by the success of Robert Eisner’s paper 
on “Expectations, Plans and Capital Expenditures: A Synthesis of Ex Post and of Ex 
Post and Ex Ante Data,” which is, in part, an application of the anticipations func- 
tion—realization function concepts developed by Modigliani and Cohen. With the 
unique data he has been able to examine, which includes forecast and actual values of 





1 In the first footnote to this article the authors state that it provides a summary view of a “full and more 
rigorous exposition given in a paper entitled “The Role of Anticipations and Plans in the Economy of the Firm 
and Their Uses in Economic Analysis and Forecasting’ .” The latter paper is now (February, 1960) being extensively 
revised and has not yet been published—the cited reference notwithstanding. 
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a number of variables, including capital expenditures, sales and profits, for the same 
firms over a number of years, Eisner has been able to test, in an admittedly tentative 
fashion, the Hicks-Harrod formulation of the acceleration principle. This has led 
Eisner to suggest some interesting modifications involving distributed lags. Econo- 
mists interested in this field will applaud Eisner’s promise to intensify his research in 
this direction in the future. 

Irwin Friend’s paper consists of a summary evaluation of the investment intentions 
surveys; and, to a lesser extent, forecast surveys of other variables. The passage of 
time has unfortunately dated this evaluation and it should now be revised to take 
into account several papers which were presented at the Universities-National Bureau 
Conference held at Princeton, November 8-9, 1957. These papers will soon be pub- 
lished. 

The final paper in the volume by William Durcovich provides a neat “Evaluation 
of Some Naive Expectation Models for Agricultural Yields and Prices.” Basically the 
author attempts to assess how well the farmer would be able to forecast these varia- 
bles using extrapolative devices of increasing degrees of “sophistication,” including 
techniques which would take into account information published in agricultural out- 
look reports. Although this paper undoubtedly provides some interesting information, 
it does not appear to be closely related either to the other papers or to the principal 
purpose of the volume. 

The wide disparities in approach among the papers included within the various 
Parts; and the immense differences in subject matter between the Parts themselves, 
provide a measure of our current understanding of the relationships among expecta- 
tions, uncertainty and business behavior. Much of this complex area is still largely 
undeveloped, yet this conference undoubtedly elicited some important contributions. 
Moreover, drawing them together serves a useful function; for it becomes more feasi- 


ble to assess the gaps and inconsistencies in our knowledge and helps to indicate re- 
search avenues which might usefully be pursued. While this purpose of the volume 
ultimately can be served only by a careful study of the papers themselves, the per- 
ceptive Introduction provided by Mary Jean Bowman will prove of great assistance 
to those who wish to evaluate the weaknesses and highlight the strength of our present 
understanding. 


The Federal Funds Market: A Study by a Federal Reserve System Committee. Board 
of Governors of Federal Reserve System, 1959. 


Evuaene C. Zorn, Jr., Dallas, Texas 


HIs work of a Federal Reserve System Committee contains data and other infor- 

mation that are obviously valuable for the formulation and execution of monetary 
policy. The committee, in addition, has performed a genuine service for the banking 
community, money market observers and operators, and students of money and 
banking. Facts concerning the Federal funds market have been scattered and incom- 
plete. Their assembly into a well-organized and highly informative booklet was long 
overdue. 

The progressive development of flexible monetary policy over the past decade has 
complicated the process of money market analysis. New instruments and techniques 
have been developed and others have been taken down from the shelves and dusted 
off. Concepts and traditions that once seemed permanent have been modified or dis- 
appeared. Flexible monetary policy has not been thrust upon us suddenly. We have 
had to grow up to it. Part of this growing-up process has been the gaining of knowl- 
edge and experience regarding the market process and the forces of supply and de- 
mand that shape such a process. 
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That is the story of the Federal funds market. In recent years it has become one of 
the most interesting and significant elements in the whole complex of money market 
analysis and policy administration. Not too long ago detailed knowledge of Federal 
funds operations was confined to a relatively few specialists at the trading desks of 
dealers, the money desks of banks, and within the Federal Reserve System. But the 
increasing sensitivity of the money market under conditions of restraint is making it 
more essential for managers of short-term investment funds to understand the intri- 
eacies of all facets of their field. The System Committee has given us a good set of 
working tools to add to the fund of knowledge for this purpose. 

Apart from the descriptive information on Federal funds operations, the Fed has 
plowed fertile ground in providing data as of November 1956 on the size of the Fed- 
eral funds market, the types of transactions, their geographic distribution, and rates. 

Three years have elapsed since the date of its survey. Deposits are turning over 
more rapidly, the discount and market rates are significantly higher, and Treasury 
bills—a money market instrument that competes with Federal funds—have increased 
substantially in amount, and have come to rest in the hands of large non-bank in- 
vestors who are gaining a sophistication regarding money market analysis. Develop- 
ments such as these give rise to new relationships within the money market. To help 
interpret their significance, a continuous flow of information is highly desirable. The 
broader the understanding, the more perfect and smooth the market process can be in 
reflecting basic forces of supply and demand. It is hoped, therefore, that the Federal 
Reserve will not allow too long a period to elapse before making another survey giving 
us a profile of the Federal funds market. 


Comparisons of the United States and Soviet Economies. Joint Economic Committee. 
Congress of the United States. 3 parts (616 pages; $1.70); and a volume of Hearings (292 
pages; 70 cents). September 1959-January 1960. Government Printing Office, Washing- 
ton, D. C. 


Leon M. Herman, Library of Congress 


HIS series of publications grew out of a skillful, well-timed enlistment by the Joint 

Economic Committee of the expertise of a group of working economists for the 
task of summing up their latest findings on the performance of the Soviet economy 
and discussing their implications for national policy-making in the United States. 

The first three volumes consist of papers written by the individual economists in 
advance of the hearings, by way of communicating with the other participants in the 
symposium. These papers are grouped into sets of panels by broad subject-matter: 
industry, agriculture, national income, etc. The fourth volume carries the summary 
statements made by the panelists during their appearance as witnesses before the 
Joint Economic Committee and the discussions that followed. 

While the orderly division of the papers by panel, or by volume, is at times upset by 
reason of late arrival of papers, the contributors to these volumes fall into three dis- 
tinct groups. The largest group is made up of analysts working in the field of Soviet 
economics. Between them, they present a documented, comprehensive review of re- 
cent developments in the Soviet economy, offering wherever possible a framework 
within which Soviet performance can be realistically compared to that of the United 
States. This group accounts for eight out of the total of ten panels taking part in the 
symposium. 

The ninth panel consists of representatives of non-governmental policy-making 
organizations (American Farm Bureau, National Education Association, Committee 
for Economic Development, etc.), who discuss the challenge represented by the ris- 
ing economic capabilities of the Soviet Union in the spheres of the economy with 
which they are familiar. 
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The final panel, by far the most interesting for the general reader, consists of a 
group of “elder statesmen,” seasoned observers of the Soviet world-wide drive for 
power, who summarize the salient facts placed on the record by the sector specialists 
and offer their evaluation of where we stand in the economic race with Moscow. 
Most of the attention of this group is, however, devoted to the important unfinished 
business that lies ahead if we are to avoid seeing our power eroded and the economic 
viability of the free world sacrificed by the failure to use in full the resources at our 
disposal. 

Above all, this remarkable symposium has the great advantage of timeliness. The 
vast and varied body of facts it has assembled, having been written down by the work- 
ing analysts between May and September in 1959, are the fresh data on the basis of 
which public thinking on the subject will be governed for the months ahead. The 
business of commercial publishing being what it is, it is a fair guess that some of these 
facts will probably turn up in the textbooks some two or three years hence. 

The ideas on what are the alternatives for United States policy, domestic and 
foreign, stimulated by the findings of these investigations among the four members of 
the final panel, are similarly fresh and operative. A good representation of these ideas 
is mirrored in the highly perceptive paper of W. W. Rostow (part III), who discusses 
with the extra sense of perception of the economic historian the serious adjustments 
we must make, in our policy-making in general and in our allocation policy in particu- 
lar,for coping with the type of long-term, unspectacular threat of erosion to our 
power and to the survival of democratic political institutions represented by the 
planned, inexorable expansion of the power of the Soviet Union. 


An Econometric Model of Postwar State Industrial Development. W. R. Thompson and 
J. M. Mattila. Detroit: Wayne State University Press, 1959. Pp. vii, 116. $3.00. 


Wituram A. Cromarty, Connell and Company 


4 om authors of this book have tackled two difficult problems, (1) the theoretical 
problem of constructing an economic model which adequately explains industrial 
development, and (2) the equally difficult task of selecting estimators which give 
quantitative estimates of the economic relationships. Unfortunately, neither problem 
is successfully completed. 

The authors decide upon the “estimated average annual change in the number of 
workers employed, by states” as the variable to be explained, measured for the years 
1947-1954 inclusive. 

The independent or explanatory variables are taken to be specific indicators which 
fall in the broad categories of, 

(a) growth of the local market, 

(b) factors reflecting the degree of industrialization, 

(c) factors reflecting the level of state and local taxes, 

(d) factors reflecting characteristics of the labor market, 

(e) factors expressing the educational level of the population, and 

(f) patents granted, expenditures, and employment in 1947 for specific industries. 

The selection of the dependent variable as an indicator of industrial development 
seems to rest on the fact that data were available. Such a criterion is weak. Employ- 
ment, as they point out, is only one variable which measures industrial development 
and it is questionable if it is a good overall indicator. For instance, for the period 
1947-54, durable goods sales for the United States rose from 6.7 to 11.2 billion dollars, 
almost a 70% increase; or in physical terms the index of industrial production for dur- 
able goods rose 38 points from 1947 to 1954. However, the number of workers in dur- 
able goods manufacturing showed a four per cent change, going from 15,321,000 to 
15,995,000. A more significant example, although not included in the text, is the case 
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of agriculture. Farm employment dropped from 10.4 million in 1947 to 8.6 in 1954. At 
the same time farm output on a physical production index basis jumped from 95 in 
1947 to 109 in 1954. 

Of an equally serious nature is the violation of the assumption that in least squares 
regression analyses, which the authors choose to use, the dependent variable is made 
up of observable data. Yet the footnote on page 89 indicates that some of the data 
were not available and had to be estimated. 

Many questions could also be raised with regard to the selection of the independent 
variables. Personal income, for what is almost a current period, i.e. 1945-8, does not 
appear to be a good determinant of the number of workers employed. The reverse 
relationship would appear more plausible, which suggests that the treatment of this 
problem on the basis of single equation models may be too simplified for accurate re- 
sults. 

The authors appear to have trouble explaining the positive relationship between 
average hourly earnings and the number of workers employed. In reality it may be a 
very realistic relationship, except that the equation represents a supply relationship. 
This might be considered as an omission of identification procedures. 

The estimating techniques are simple—correlation coefficients and linear regression 
analysis. The impression is left on page 35 that the technique itself selects the particu- 
lar variables to be inserted into each equation and in some way provide sufficient 
estimates. The criterion of selection is not stated, however, and if it was on the basis of 
the values of the simple correlation coefficients the criterion is again very weak. 

In all fairness to the authors it should be pointed out that many of the inadequacies 
of the study are apparent to them and they have pointed out some of these in Chap- 
ter V. Their interpretation of the regression analysis results between industry classifi- 
cations is commendable and probably the most useful section of the book, and a good 
summary of their findings is presented in Chapter VI. The book will probably be use- 
ful to persons involved in state planning in spite of the statistical and economic short- 
comings pointed out previously. 


Operations Research: Methods and Problems. Maurice Sasieni, Arthur Yaspan, and 
Lawrence Friedman. New York: John Wiley & Sons, Inc., 1959. Pp. xi, 316. $8.50. 


Harvey M. Waener, Stanford University and The RAND Corporation 


HE instructor of an elementary course in operations research (OR) expects ulti- 

mately that his students achieve the level of vaguely understanding the A, B, C’s 
of probability, sampling, inventory analysis (JA), replacement policy (RP), waiting 
line theory (WL, or more briefly Q), game theory (GT), linear programming (LP), se- 
quencing of operations, and dynamic programming (DP). The textbook by M. 
Sasieni, A. Yaspan, and L. Friedman (whose initials we rearrange as a mnemonic 
abbreviation to MYLAF'S) can be recommended for this purpose, as these men pro- 
vide a chapter on each of the subjects, and for the most part spell out the material at 
the aforementioned level. 

The readers are assumed to know the calculus, hopefully to have mastered “a 
modicum of statistics . . . equivalent to . . . a one semester course,”* to be graduates 
or advanced undergraduates, and to wish to “develop skill in formulating and solving 
mathematical models” within one semester. 

In each section the authors, who developed the course material at Case Institute of 
Technology, admirably furnish a discussion of general theory and certain computa- 
tional techniques, several completely solved problems demonstrating the techniques, 
and student’s exercises; further each chapter is “complete in itself as a teaching unit.” 





* The students’ memories are refreshed by the chapters on probability and sampling. 
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Evaluating the book in absolute terms, this reviewer, who has used the text in a 
class exactly meeting the prerequisites stated, feels that by and large the material is 
too easy and the scope too limited. Needless to say, the students were not in perfect 
agreement with this conclusion. In relative terms, the book compares most favorably 
with other competing volumes. 

A sample of a few specific comments is in order. The LP chapter is about the best 
insofar as breadth of coverage is concerned. Little tie-in is made between the chapters 
on JA, RP, and DP. In the Q chapter, Erlang distributions are cited; in the probabil- 
ity chapter, the gamma distribution is described; but no connection between the two 
is mentioned. The authors deserve a mixture of kudos and brickbats for their GT sec- 
tion: the examples and problems are challenging. But my students failed to fathom 
from the text material what exactly a maximin strategy was in simple two-person 
zero-sum games (e.g., the class even had difficulty in verifying such strategies when 
they were stated numerically for particular games), and were misled as to how to 
compute algebraically such strategies. The GT chapter precedes the LP chapter, so 
the authors do not mention LP when discussing GT’; this situation is not remedied in 
the LP chapter. Although the authors in Chapter 3 argue for the usefulness of sta- 
tistical methods in OR, they noticeably sidestep demonstrating the relevance of such 
techniques at the crucial points in subsequent chapters. In short, many of the de- 
ficiencies stem from the authors’ well kept promise of making each chapter complete 
in itself as a teaching unit. 

Of course the imperfections in textbooks usually serve their purpose in providing 
material for lectures, and in recommending the book, this reviewer notes the ease he 
discovered in augmenting MYLAFS in the classroom. 


Yearbook of Fishery Statistics, Vol. IX, 1958. United Nations, Food and Agriculture 
Organization. New York: Columbia University Press, 1960. Pp. xxix, 89. $4.50. 
HIS volume continues the publication of data on catch and disposition of fish and 
fishery products throughout the world. Estimates for countries and species are 
presented. Results of a thorough revision of statistics on products by Japanese author- 
ities are incorporated. Designations of material in tables are in English and French, 
textual material is also in Spanish. C. H. 


Consumer Expectations, Plans, and Purchases: A Progress Report. F. Thomas Juster. 
National Bureau of Economic Research Occasional Paper 70. New York, National 
Bureau of Economic Research, Inc., 1959. Pp. xi, 171. $2.50. Paper. 


Morris Cowen, National Industrial Conference Board 


_— past fifteen years have been marked by a considerable interest in expectational 
research. The underlying thesis is a simple one. Each of the major economic sectors 
plans future activity. If these plans can be tapped and properly interpreted, then 
greater insights will be gained into subsequent behavior. Businessmen plan ahead for 
capital goods spending through capital budgets and capital appropriations, and 
governments plan ahead through annual budgets. Both business and government 
plans are generally formalized in written documents. Consumers also plan, but 
mostly in an informal manner, and their plans must be obtained either through per- 
sonal interviews or mail surveys. 

The pioneering work of George Katona and the University of Michigan Survey Re- 
search Center in developing consumer surveys focussed on consumer attitudes and 
buying plans is well known. For years, they were the only group continuing to make 
and publish studies on consumer expectations’ research. Nevertheless, there were 
still some doubts. The 1955 Report of the Consultant Committee on Consumer Survey 
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Statistics organized by the Federal Reserve Board (one of a series of five distinguished 
reports too long neglected by the profession) paid homage to these pioneers, but still 
recommended that additional investigators in other institutions study consumer be- 
havior through expectatioual data. 

Mr. Juster’s report represents a major methodological advance precisely of the type 
which the Federal Reserve Committee wanted. It is based upon mail survey data 
which are completely independent of the Michigan Survey Research Center. Sub- 
scribers to Consumer Reports published by Consumers Union of the United States, a 
nonprofit testing and rating organization, reply evidently in very large numbers to 
questionnaires. It is admitted, of course, that such a survey is not representative of 
all the nation’s households; it is not even a random sample of the magazine’s own 
subscribers. The results that emanate from this nonrandom survey may still prove to 
be of great methodological significance. They may not prove to be useful for business 
forecasting, though Juster presents evidence that they have been in the past, but they 
may well provide some underpinaings for a better theory and understanding of con- 
sumer behavior. 

There is some merit to Juster’s implicit foundation of his research: if you want to 
study consumer buying plans in depth, then ask the people who are most likely to 
plan. Subscribers to the activities of Consumers Union probably represent the most 
planning-conscious group of consumers in the country. What is missing most in this 
approach, of course, are the consumers who buy on impulse. The impulse buyers, 
those who purchase without a plan, can better be investigated through a survey of the 
entire population. Some question also remains about the representativeness of the CU 
planners. They may prove to be a different type of planner not representative of all 
consumers who plan. That is why in the final analysis the Juster results must be con- 
firmed through random probability surveys. 

The broad results of this progress report represent additional evidence on the use- 
fulness of consumer buying plans and attitudes as a predictive tool. For example, it 
was found that from 1949 through 1957, the year-to-year changes in Consumers Union 
subscribers’ buying plans for automobiles and major appliances foreshadowed actual 
purchases for the country as a whole. Taking income into account, fully 95% of the 
annual variations in these purchases were statistically explained by two variables— 
changes in disposable income and buying plans. Clearly, further efforts and additional 
research in buying plans are warranted. 

However, the bulk of Juster’s present research report centered on 25,000 re-inter- 
views in April, 1958, of respondents to an earlier survey in October, 1957. This repre- 
sented the largest re-interview samples ever taken which were concerned with con- 
sumer buying plans. Only with samples of this size can all the interrelationships be 
studied, holding various factors constant while other variables are analyzed. 

One of the more important findings of the study relates to the time horizon of the 
planning period. By breaking down his basic 25,000 re-interviews for April 1958 into 
five subsamples, Juster is able to introduce experimentation into the field of consumer 
expectations. He found, as has the writer working with other materials, that “the 
level of buying plans reported in a given survey is sensitive to variations in the time 
horizon covered by the question asked.” Further, buying plans for the next six months 
are “more apt to be meaningful indicators of future actions by households and are 
probably more useful as ingredients in a forecasting model.” Hitherto, most consumer 
buying plan surveys have used a twelve-months’ planning horizon. The Juster evi- 
dence thus provides further support for the more recent trend toward framing con- 
sumer buying plan questions with a shorter time span. Both the Conference Board’s 
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continuing survey of consumer buying plans and the Census Bureau’s still unpub- 
lished surveys in this field have been using the shorter planning period. 

As far as expectational variables are concerned, perhaps the most provocative 
finding was the significance of household expectations about personal financial pros- 
pects over a five-year future period. These were among the most powerful influences 
affecting both plans and purchases. Since presumably such expectations are less 
likely to fluctuate over the short-run, it would appear that some magic formula has 
emerged which can identify the more accurate planners. It is recognized that such 
identification may not be useful for aggregate predictions, since these expectations 
might be stable over time. Nonetheless, if this result could be substantiated it would 
represent a major contribution toward a better understanding of consumer behavior. 
This example illustrates the uncertainty about many of the important findings from 
this study and reflects the lingering doubts about the general applicability of results 
from nonrandom surveys. 

Since most of the report is concerned with relationships between surveys of the 
same consumer units taken in October, 1957 and April, 1958, the whole question of 
seasonality in buying plans and attitudes must be raised. In The Conference Board’s 
survey of consumer buying plans, a random, telephone survey of 5,000 households 
taken each month, it has been found that buying plans and some attitudes are marked 
by sizeable seasonal fluctuations. Furthermore, the timing of the Consumers Union 
surveys was intertwined with the 1957-8 business recession. Both cyclical and sea- 
sonal factors affect consumer buying plans and behavior, perhaps even the carrying 
out of the planned purchases. Any results that emerge from two observations in time, 
particularly those at cyclical turning points, must await further confirmation from 
additional research. 

Professor Albert Hart, under the sponsorship of the Ford Foundation, is continuing 
the effort forward in time, using the CU survey panel, and his further results are 
eagerly awaited. Still, all investigators in the field of consumer behavior and particu- 
larly those specializing in consumer buying plans and attitudes will, I am sure, want 
to consult the many fruitful hypotheses and conclusions presented in this careful 
study. 


The Population of the United States. Donald J. Bogue. Glencoe, Illinois: The Free Press, 
1959. Pp. xix, 873. $17.50. 


Watter B. Watson, University of Washington 


HIS ambitious volume is the fourteenth in a series on population distribution by 

Bogue and others of Scripps Foundation and the Population Research and Train- 
ing Center of the University of Chicago. The author maintains the high standards set 
by the earlier monographs in a work of truly staggering proportions. Twenty-six 
chapters and 701 double column over-sized pages include 384 tables and 93 figures, 
graphs, and charts. In addition the text contains numerous small unnumbered sum- 
mary tables. There are also a 68-page statistical appendix, a 69-page tabular supple- 
ment to the chapter on occupational composition giving numerous demographic cor- 
relates of more than 400 specific occupations, and a similar 21-page supplement to the 
industrial composition chapter for 77 industries. 

The author opens with an account of the size and growth of the population and 
closes with a discussion of characteristics and implications of the future population. 
In between are four excellent chapters on distribution (urban-rural, metropolitan- 
nonmetropolitan, states and regions, and the system of economic areas), one each on 
mortality and fertility (the latter by Wilson Grabill), two on migration (international 
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and internal), five on economic characteristics (labor force, occupation, industry, un- 
employment, and income), ten on other characteristics (age, sex, race-nativity, 
marriage, households and families, housing, education, institutional population, 
health, and religion), and one chapter on Alaska and Hawaii. The chapters are 
minimally interrelated and hang together only in terms of the content of the census 
and other available data rather than in terms of an underlying theory or point of 
view; this comment, however, is more a commentary on the character and state of ad- 
vancement of the field than a criticism of the author. 

The use of age as a control throughout and sections on metropolitan areas in sev- 
eral of the characteristics chapters are particularly well done. Grabill’s chapter on fer- 
tility is good but contains an unfortunate and unnecessary use of the value laden 
term “welcome” in a reference to the postwar baby boom. (p. 302) International com- 
parisons in several chapters provide a helpful comparative setting within which to 
view the United States population. 

“The book is intended primarily for reference use, and has been written and indexed 
with this in mind.” (p. tv) The excellent tables and illustrations and the lucid text are 
a major contribution in this regard. Many of the data are taken from standard census 
sources including various Special Reports and from the vital statistics publications, 
but the book is liberally and effectively supplemented and brought up-to-date by the 
inclusion of data from the Current Population Survey. The latter help to achieve the 
formidable task of “analyzing and interpreting the results of the 1960 census before it 
was taken.” (p. tit) The reference value of the volume is further enhanced by the in- 
clusion of data from the National Health Survey in a chapter on illness, data from the 
National Opinion Research Center in the chapter on religion (particularly in view of 
the absence of information on religion in the 1960 census), data from the National 
Housing Inventory in a housing chapter, and some unpublished data and special 
tabulations by the author. Introductions to the various chapters are adequate; the 
chapter summaries might well be expanded somewhat. 

Frequent small typographic and editorial errors in text and tables detract from the 
otherwise excellent reference value of the book as does the absence of a bibliography. 
Small errors, such as failure to indicate that the numbers are in thousands in some of 
the tables, and no bibliography are unlikely to trouble professional demographers, 
but they may bother students, laymen, and even scientist users from other fields. A 
highly selected bibliography would have taken only a few pages and would have been 
an asset. More careful editing could have eliminated confusion arising from state- 
ments such as the following a few lines apart: “The number of foreign-born persons is 
still quite large .. .” and “The fact that the population no longer contains large num- 
bers of foreign-born persons . . .” (p. 348). 

Two pleas by Bogue deserve mention. In the chapter on distribution by economic 
areas and again in the internal migration chapter he advocates greater use of the 
socially and economically defined system of state economic areas as units for demo- 
graphic analysis in lieu of the politically defined system of states and census regions. 
The reviewer agrees that for many purposes the economic areas are clearly superior. 
Bogue also calls for more attention to detailed occupational and industrial categories 
instead of the traditional eleven and twelve category classifications. In view of the 
importance of occupation as a social and economic variable, the reviewer agrees and 
hopes that attention will be directed to the lengthy tabular supplements to the occu- 
pation and industry chapters noted above. 

The few criticisms mentioned are minor relative to the breadth of coverage, the im- 
pressive compilation of data, and the over-all high quality of the book. The reviewer 
hopes that the price will not prevent demographers, students, and laymen in applied 
fields from making extensive use of this outstanding work. 
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Catalytic Models in Epidemiology. Hugo Muench. Cambridge, Mass.: Harvard University 
Press, 1959. Pp. xiv, 110. $4.50. 
Maraaret Martin, Johns Hopkins University 


HE author develops simple mathematical models for expressing two types of rela- 

tionship in the field of epidemiology. One relationship is that of the proportion by 
age of individuals showing evidence (by medical history or laboratory test, or both) of 
having had, at some time in the past, effective contact with a given infectious agent. 
An underlying assumption of these models is the existence of a “force of infection” 
(number of effective contacts per person per year) which remains constant over time 
and for all ages within the range considered, or which is a specified function of age. 
The “force of infection” then becomes a meaningful constant in the equation. The 
models apply to the situation where one is taking a broad view of the occurrence of in- 
fection in moderately large populations over fairly long periods of time, ignoring 
fluctuations occurring over short intervals and between age groups. Much of the 
author’s excellent discussion of the advantages and limitations of this simplified ap- 
proach applies to model-building in general. 

The other relationship, treated in a single chapter, is that of the survivorship ratio 
1,/l, for comparing survival of individuals with a chronic condition to that of a life- 
table population of similar composition, over a time period after the date of diagnosis, 
say. Here an “increased force of mortality” analogous to the “force of infection” above 
is postulated. 

The equations obtained are those of the simple catalytic (exponential) curve, with 
fixed and with movable asymptote, together with variations which include the cases 
of truncated data and of a superimposed epidemic of short duration, the reversible 
catalytic, and the two-stage catalytic. 

The method of fitting is that of moments, using the zeroth and first moments of the 
distribution by age, or time, with frequencies proportional to the per cent positive in 
each age group. Rapid graphic estimates of the constants for all the models are ob- 
tained by the ingenious use of three simple nomographa. 

The assumptions underlying each of the models are clearly stated, and the fact that 
some of them may be rather unrealistic is discussed. In particular, the data on yaws 
and on histoplasmin sensitivity do not seem to satisfy very well the assumptions of 
the two-stage catalytic model, which they are used to illustrate. The usefulness of 
studying departures from the model for clues to the phenomenon under study is 
pointed out and illustrated. 

More discussion could well have been given to the limitations of the method of 
fitting used and to the possibility of using alternative methods even though presenta- 
tion of the latter is not within the scope of the book. The beginner may not recognize, 
for example, that percentages in age groups where the number examined is small may 
have undue influence on the constants of the equation. For situations where more pre- 
cise estimates are desired, together with standard errors, one would feel justified in 
recommending the method of least squares. 

The following details might be mentioned: 

On page 33, it may not be immediately apparent to the student that k equals the 
ratios of observed to calculated values of 2’A as well as of 2A. 

On page 37, the right hand form of equation 4.13 would seem a more direct ap- 
proach than the form preceding it. 

On page 74, last sentence, it would seem that the word rates rather than ratios is in- 
tended. 

On pages 91 and 93, the expression “confidence limits” is used incorrectly. 

On page 93, second paragraph, the chief disadvantage of wide age grouping is not 
the one stated. 
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A few of the statements in Chapter 8 (top of p. 91 and middle of p. 94) do not seem 
to be in line with the best current and, it is hoped, future practice in sampling. 

In summary, the reviewer feels that this book gives the student an interesting 
and understandable introduction to model-building which is simple enough to en- 
courage him to try his own hand at the game. She has reservations concerning some of 
the assumptions underlying the models, as does the author, and about the method of 
fitting. 


Regression Analysis. E. J. Williams. New York: John Wiley and Sons, 1959. Pp. x, 214. 
Jack Kierer, Cornell University 


u1s book offers the practical statistician a compendium of the classical techniques 
p eomareel with regression analysis, The author has included references from re- 
cent literature, but the point of view is that which existed before the advent of sta- 
tistical decision theory. The book is full of good numerical examples which also serve 
to illustrate the way in which departures from the usual simple models arise in prac- 
tice. In particular, there are good examples of such topics as estimation from non- 
linear equations, estimation of zeros of a fitted curve, etc. A great many of the ex- 
amples are taken from the field of forestry. 

The contents are: Introduction, Linear Regression, Multiple and Polynomial Re- 
gression, Regression Equations Requiring Iterative Calculation, Choice Among Re- 
gression Formulas, Estimation from the Regression Equation, The Analysis of Co- 
variance, The Treatment of Heterogeneous Data, Simultaneous Regression Equa- 
tions, Discriminant Functions, Functional Relations, References, Index. 


The New Inflation. Willard L. Thorp and Richard E. Quandt. New York: McGraw-Hill, 
1959. Pp. xi, 233. $5.00. 


KENNETH E. Boutptne, University College of The West Indies 


H1s book evidently emerged as a result of a conference of twenty-three economists 
Va the Merrill Center for Economics in July 1958. It is not, however, intended as a 
summary of the discussions, but is the sole responsibility of the authors. Its attention 
is focussed on the experience of the postwar years. The title seems to imply that this 
“new” inflation is somehow different from the old—there are no very startling conclu- 
sions, however, and the general style of the work is somewhat textbookish. More at- 
tention is paid to structural pressures, and what has elsewhere been called “cost- 
push” inflation than would have been the case twenty-five years ago. It is perhaps a 
tribute to the relative stability of economics that there are so few surprises in this 
authoritative and competent treatment. I must confess that I found the book judi- 
cious to the point of dullness. The truth seems to be that economics has reached a cer- 
tain plateau of analytical competence in regard to this question, which this book 
faithfully reflects. 

One suspects, perhaps, that this is not the final word and that there are mysteries 
yet to unravel. One would like to know more, for instance, as to the role of expectation 
and disappointment in the general dynamic process. One would like to know more 
also about the subtle processes by which information and images are transmitted 
through the economic system. A complete study of inflation, for instance, would al- 
most have to have a chapter on fashions, moods and epidemics of belief. These higher 
flights of social science, however, are hardly to be expected from a meeting of twenty- 
three economists; one wonders, however, if the mixture had included a few non-econo- 
mists whether something a little more exciting might not have come out of it. How- 
ever, it is perhaps a little ungracious to judge a book for what it does not pretend to 
be; for what it is this is an admirable statement of the present state of mind on the 
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question of inflation among respectable economists, and the authors are to be con- 
gratulated on an unusually lucid exposition. 


The Theory of Marketing in Underdeveloped Countries. J. N. Chaturvedi. Allahabad, 
India: Kitab Mahal, Publishers, 1959. Pp. 135. $2.00. 


Lincotn Ciark, New York University 


HE best characteristic of this little book is its intriguing title and it merits com- 

mendation for focusing attention on a very important problem. It is the author’s 
response to Galbraith’s criticism of Indian economists for not developing “what could 
be termed an Indian Economic Philosophy.” 

Chaturvedi’s theory stems from his contention that Western economic theories, 
being based on conditions in rich countries, do not apply to poor countries. His theory 
starts by distinguishing between “real marketable surplus” and “forced market sur- 
plus.” The former is that portion of output freely released by a producer; the latter is 
what he is forced to turn over to his creditors. For the rich (U. 8.) farmer, the ratio of 
real to forced marketable surplus is very high, for the poor (Indian) farmer the ratio is 
exceedingly low. When this ratio is very low, middlemen are not beneficial agencies 
resulting from a division of labor but they are, in fact, only extortionate agencies. The 
solution, therefore, is to supplant them. Grain banks would be established in every 
village for the storage of all locally produced and consumed produce. These banks 
would issue (without charge) certificates for all deposited produce. District grain 
banks would maintain records of the amounts on deposit in the village banks and 
have equipment for transporting produce. Consumers would pay in advance 20-25% 
of their monthly income to special retailing societies. These payments would be routed 
to the grain banks for making advances to the farmers. Concurrently, these retail 
societies, located principally in urban areas, would collect information about demand 
from consumers, mills, exporters, etc. and transmit the information to district offices 
(presumably the district grain banks). The state office would equate and direct the 
district demands and supplies. This rational scheme would substantially cut distribu- 
tion costs and, since a very high percentage of the national income of poor countries 
goes for food, it would improve the national welfare. End of theory. 

Even conceding that a politically acceptable method were promulgated for price- 
setting in order to eliminate the extortionate profits of middlemen, application of the 
Chaturvedi theory would encounter great cost probiems, particularly because it would 
require a great increase in the ratio of fixed to variable costs. India, for example, has 
almost countless farmers, villages, wholesalers, retailers, and consumers, most of 
them operating on a pitifully small scale (and in this reviewer’s opinion, with extreme 
efficiency in their handling of small heterogeneous outputs and minimum capital 
equipment). The quick, in contrast to the slow introduction of very large scale market 
institutions would likely incur much higher costs in collecting, transporting, sorting, 
warehousing, and retailing the extremely vast quantities of very small heterogeneous 
outputs. 

The book presents ample data and convincing arguments that demonstrate that it 
is harder for a country to develop if it is poor than if it is rich. Our hearts can bleed for 
the plight of the poor, and we can sympathize with an innovator’s search for a massive 
quick remedy. In the very old, very populous and very poor countries, however, less 
dramatic but more effective, is likely to be the work of hundreds of innovators in- 
fluencing the actions of thousands of market institutions to change the consumption 
pattern of millions of consumers. The poor countries are going to accelerate the pace 
of economic development and are prepared to take drastic measures. The rich coun- 
tries can influence the choice—and one way is to help produce a theory of efficient 
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market development. Chaturvedi’s book does reinforce the view that a larger scale of 
operations of particular market institutions is the key. Perhaps this is the clue to the 
book editor’s question, “what is the significance of this book for statisticians?”. It 
seems to call for empirical data arranged in frequency distributions of the size of vari- 
ous types of market transactions. Such data, even without correlates, should help give 
rise to sound hypotheses for increasing the scale of operations. 


Statistics for Business Decisions. Kurnow, Glasser, and Otiman. Homewood, Illinois: 
Richard D. Irwin, 1959. Pp. xi, 523. $7.25. 


Workbook to Accompany Statistics for Business Decisions. Kurnow, Glasser, and Ottman. 
Homewood, Illinois: Richard D. Irwin, 1959. Pp. 189. $3.95. 
Desert C. Hastinas, University of Minnesota 


u1Is book represents a partial step away from the conventional “compendium of 

methods” business statistics text toward the idea of statistics as a tool in business 
decision-making. The step is only partial, however; the book is by no means a rigor- 
ous or even a general introduction to decision theory but is perhaps more aptly de- 
scribed as a recasting of conventional business statistics content in the direction of 
decision theory. This comment is not intended as criticism; the direction of change is 
probably desirable and the book may facilitate a change in content of the typical 
business statistics course which is likely to be gradual in any event. 

The scope of coverage of the book is slightly broader than comparable older books; 
elementary design of experiments, sequential sampling, control charts, for example, 
are introduced. Otherwise the instructor using the text need be prepared only for a 
slightly different order of presentation and slightly different emphasis from the usual 
text. 

A unique feature of the accompanying workbook is that all the problems and ques- 
tions center about a fictional city called Statistica. This device permits applications 
to problems of school and local government functions, sample surveys, operating 
problems of local firms, and marketing and industrial aspects of the community. The 
technique provides interest and continuity. 

Problems and examples seem to be well chosen. One small point of taste: The fad- 
dish use of “You should now know that, You should now be able to solve, and You 
will also find that” in place of “Summary, Exercises, References” quickly becomes 
tedious if not irritating. 

The National Debt Ceiling. Marshall A. Robinson. Washington: The Brookings Institution 
1959. Pp. vii, 104. $1.50. Paper. 


J. Frep Weston, University of California, Los Angeles 


HIS monograph is a gem. It is a beautiful demonstration of economic analysis ap- 
T plied to a significant issue of public policy. Statisticians will be interested in this 
study as an excellent illustration of how economic data can be used to make a dis- 
passionate analysis of an important problem, satisfying the criteria of (1) relevance, 
(2) honesty and balance, (3) rigor, and (4) good judgment. 

In a systematic fashion the national debt ceiling is analyzed under the following 
points: (1) origin, (2) characteristics, (3) the case for it, (4) the case against it, (5) the 
economics of national debt, and (6) proposal for fiscal discipline. The legal limit on 
federal borrowing was first placed in a borrowing act of 1917. In 1941 the debt limit 
in its present form was established when Congress grouped all forms of debt into one 
figure. The only change in the debt ceiling between 1941 and the time the present 
book was written was an increase in the ceiling from $65 billion to $288 billion. 

The federal debt ceiling reflects the almost irrational fear of debt in general. The 
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main argument for the ceiling is that it can be used as a device for controlling govern- 
ment spending and to control the growth of the federal debt. Other arguments pre- 
sented by the advocates of the debt ceiling are that: (1) the debt level is too high, (2) 
too costly, (3) a major source of inflation, and (4) it threatens national solvency. 
Mr. Robinson cites the names and the publications in which arguments of these kinds 
have been presented. 

The case against the debt ceiling begins with an analysis of the economics of the 
national debt. To begin, it is important to recognize that the major increases in the 
national debt have come during periods of active war such as World War I, World 
War II, and Korea. Robinson demonstrates that a rising debt during peacetime need 
not be inflationary; in fact, failure of government debt to grow appropriately can 
stunt economic growth. Furthermore, increases and decreases in the national debt 
can be useful in anti-cyclical economic policy. He assembles data to demonstrate that 
the interest costs of federal debt as a percentage of gross national product have been 
unchanged since 1945. Furthermore, no great change has developed since 1920. He 
points out that most of the federal debt is held internally rather than externally and 
that the transfer effects of debt payments have probably had little influence on either 
the distribution or level of national income. 

In view of the proper role which a normally growing federal debt has in the develop- 
ment of the economy, the harmful effects of the debt ceiling are both undesirable and 
unnecessary. The kinds of mischief that may be caused by the debt ceiling are seen in 
sharp perspective in the summer and fall of 1957, when due to the limits of the debt 
ceiling, the Air Force ceased to pay its bills. The impact of these limitations caused 
a cut-back in defense spending on the order of magnitude of $8 billion at an annual 
rate. Coming at a time when the business upswing was already waning, the sharp 
reversal in spending was sufficient to trigger the down-turn of 1957-8. Despite the 
cut-back in spending imposed by the debt ceiling, the ensuing recession resulted in a 
federal deficit of $12.4 billion during fiscal 1959, representing the largest peacetime 
deficit the country has ever incurred. Thus the irony of the debt ceiling is that it has 
caused large increases in the national debt, in turn requiring elevations in the per- 
mitted ceiling. 

Not only may the debt ceiling do the opposite of what it is intended to do, a second 
major criticism is that it leads to fiscal subterfuge. It causes the federal authorities to 
use costly and wasteful means to finance programs that Congress has approved. 
Robinson gives clear illustrations of this point. 

Robinson points out that Congress can make better progress toward fiscal disci- 
pline by replacing the debt ceiling with budgetary reforms that force Congress to 
take a comprehensive look at appropriation bills in relation to the revenue system. 
He mounts a persuasive case that rules are no substitute for understanding. Bad 
rules are especially pernicious. Robinson demonstrates that the debt ceiling repre- 
sents the substitution of ritual for intelligence and study. It is a device which has 
been used as an evasion of true fiscal responsibility. The Brookings Institution per- 
forms a most valuable public service by the support of studies of this kind and caliber. 


The International Flow of Private Capital, 1956-1958. United Nations, Department of 
Economic and Social Affairs. New York: 1959. Pp. 107. $.75. 


ARTHUR I. BLOOMFIELD, University of Pennsylvania 
HIS report, an outgrowth of the continuing study by the United Nations in the 
field of foreign investment, is a detailed factual and statistical account of the 
volume, form, and direction of private international long-term capital movements in 
recent years, with special emphasis on tlic period 1956-8. While the focus is pre- 
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dominantly on direct and portfolio investments, some brief consideration is also given 
to medium- and short-term international banking credits. The authors have carefully 
and competently assembled in convenient form, including many useful tables, a mass 
of relevant data drawn from official statistics and other sources and from the recent 
experiences of the leading capital-exporting and capital-importing countries. The 
study, however, contains virtually no analysis, makes no attempt to relate its sub- 
ject-matter to larger current international economic problems, and avoids specific 
policy prescriptions. Its brief conclusions follow directly from its factual presentation 
and are not of a controversial sort. 

Among the most important and encouraging develo; ments in this field during the 
past few years, according to the report, have been a marked increase in the outflow 
of private long-term capital from Western Europe, especially the United Kingdom 
and West Germany; a notable revival of portfolio investments, particularly new 
flotations for foreign account; an increasing diversification of the forms of direct 
investments, despite continuing concentration in extractive industries; and a higher 
over-all flow of private foreign investment—the average annual outflow of $4 billion 
for the period 1955-8 being about twice that in 1951-2. Indeed, the report states that 
international investments have in recent years entered a new phase and that the 
prospects for a greater flow of private foreign capital and for a greater contribution of 
this capital to economic development appear better than ever before. 

On the other hand, attention is called to the disturbing fact that countries with the 
lowest incomes and with no important natural resource endowment, especially in 
Africa, Asia, and the Far East, where domestic capital formation can hardly keep 
pace with population growth, still continue to be unsuccessful in attracting any 
significant volume of private foreign capital. To some degree, official grants and 
loans have corrected this unevenness in the pattern of private foreign investment. 
But the major conclusion drawn is the desirability of increasing the contribution that 
private foreign capital can make to capital formation in the less developed countries. 
On this matter, however, the report does not outline any measures whereby such an 
objective might be realized. 

Limited though the scope cf the study is, it should provide a highly useful source 
of reference to any student working in the area of current foreign investment prob- 
lems. 
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Monroe Calculating Machine Company, Inc. 
Sales and service in principal cities everywhere. General offices, Orange, N. J. 


Please mention the Journal of the American SratisticaL Association in writing advertiseis 








Careers in 


Mathematics 


Vitro’s increased activities in the field of Operations 
Research have created career opportunities for men 
with these interests and qualifications: 


MATHEMATICAL STATISTICIANS 


MS or PhD for conducting and consulting on analytical 
studies in a wide variety of applications, including informa- 
tion theory, weapons systems analysis, experimental design, 
data treatment. Should be familiar with some of the follow- 
ing—Monte Carlo procedures, Markov processes, decision 
theory, operations research, and have had 3-5 years industrial 
experience in implementing these techniques. Position is in 
the Information Analysis Group. 


OPERATIONS RESEARCH ANALYSTS 


MA or PhD in Mathematics, Statistics or Physics. Conduct 
and direct operations research studies, principally in the 
areas of weapons systems evaluation, ballistic missile de- 
jense, anti-submarine warfare and electronic countermeas- 
ures. Should have experience in some of the following areas: 
applications of game theory, linear programming, Monte 
Carlo techniques, queueing theory and model construction. 


> Our modern laboratory is located in a suburban area with 
easy access to the cultural and educational facilities of metro- 
politan New York and New Jersey. Liberal benefits include 
a@ tuition refund plan and relocation allowances. 


Send resume and salary requirements to 


Mr. S. Roberts. 





Vitra LABORATORIES 


Div. of Vitro Corp. of America 
200 Pleasant Valley Way West Orange, New Jersey 
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Careers for 


STATISTICIANS 


At Federal Electric, ITT’s worldwide service division, continuing major 
programs of electronic equipment and component performance evaluation 
provide a challenging opportunity for the statistician to gain sound ex- 
perience in industrial statistical methodology. Several highly rewarding 
positions are now open for statisticians having: a degree with major in 
mathematics or statistics; one, two or more years experience in statistical 
work, preferably in industry; some knowledge of the functions and appli- 
cations of electronic equipment helpful but not mandatory. 


Responsibilities will include: 
Analysis, interpretation and evaluation of field problems and data 
by using statistical methods and techniques. 
Study and improvement of present statistical methods of evaluating 
field failure and operations data. 


Please write Mr. S. R. Brand, Industrial Relations, stating experience and 
salary requirements. 


FEDERAL ELECTRIC CORPORATION 


Service Division of International Teleph and Telegraph Corporation 
Paramus Industrial Park, Paramus, N.J. 

















Mathematical Statisticians 


Exceptional opportunities exist at the Naval 
Weapons Laboratory for mathematical statis- 
ticians with Bachelor’s, Master’s and Doctor’s 
degrees and an interest in operations re- 
search. The principal efforts of the Opera- 
tions Research Group at present are devoted 
to the formulation and execution of extensive 
programs in the areas of target analysis, 
weapons system analysis, and missile feasi- 
bility and evaluation. Senior Statisticians on 
the staff also serve as consultants in areas of 


statistical inference, probability, and experi- 
mental design. The most advanced computing 
equipment and capable junior scientists are 
available for assistance. Starting salaries 
range from $5,880 to $11,595 per annum. The 
Naval Weapons Laboratory provides an ex- 
cellent work atmosphere and, in addition, the 
advantages of living in a pleasant small com- 
munity with economical housing and many 
recreational facilities. 


For further information, write to the Director, 
Computation and Analysis Laboratory. 


NWL 


U. S. Naval Weapons Laboratory 
Department of the Navy © Dahigren, Virginia 
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CGALIFORNIA STATE DIVISION OF HIGHWAYS 
has a limited number of key positions for 


STATISTICAL METHODS ANALYSTS 


To qualify, mathematical-statistical annual merit increases to a maximum 
skills are a requisite: a knowledge of of $6,672 or $8,112, respectively. Em- 
such mathematical methods as cal- ployment is in Sacramento. California's 
culus and matrix algebra, and of such capital affords pleasant living: the 
statistical theory as estimation, hypoth- climate is mild; outdoor recreation 


is testing, sampling, multivariate ‘ 
sso a ~s, abounds the year round; educational 
regression and correlation, and the ae 
and cultural facilities are excellent. 


design of statistical studies utilizing 
such theories. The statistical methods The civil service examination will be 
development work in these positions given in California; no prior residence 
involves the use of advanced statistical in the state is required. Applications 
techniques requiring the use of elec- must be filed on official form, by Oc- 
tronic computers, in the evaluation of tober 28, 1960. For details and appli- 
proposed highway routes date. cation form, write: State Personnel 
Starting salaries are $5,496 or $6,672, Board, 801 Capitol Avenue, Sacra- 
depending upon qualifications, with mento 14, California. 








Statisticians Wanted 


The International Labor Office has vacant posts for labour statis- 
ticians in Africa and Asia under its technical assistance programmes. 
Applications are invited from candidates with a university training in 
economics or statistics, and several years’ experience in statistical 
work, including sampling. Applicants with knowledge of languages 
other than English should state this. Duration of assignments varies 
from six months to two years. Salaries, ranging from U. S. $8,750 
to U. S. $11,000 plus substantial allowances, are tax free. Applica- 
tions giving full particulars of educational qualifications and experi- 
ence should be addressed to: 


The Director, 
Washington Branch, 
International Labor Office, 
917-15th Street, N.W., 
Washington 5, D.C. 
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Opportunities in Computer Programming 





Making a machine 
part of a system 


To start with the raw computer 
components and shape them into an 
effective computer system—this is 
Systems Programming at IBM. It’s 
a field of work that offers rich oppor- 
tunities for achievement to experi- 
enced programmers. 


In Systems Programming, you may 
have the chance to work with ad- 
vanced computers even before they 
are commercially released. You may 
be working in one or more of these 
areas: 


e Creating master control programs 

e Establishing automatic operating 
systems 

e Providing for exception cases 
(interrupt system) 


e Determining termination pro- 
cedures 


e Programming advanced computer 
consoles which can be used to 
adjust parameters. 


If you are a qualified programmer or 
mathematician and interested in 
joining professional people working 
in a professional atmosphere — with 
ample computer time to test your 
ideas — write to: 


Mr. T. E. Burns, Dept. 577U 
IBM Corporation 

2561 Strang Blvd. 

Yorktown Heights, N. Y. 


IBM: 


INTERNATIONAL BUSINESS MACHINES CORPORATION 
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In Systems Programming, you may 
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QCAS 


... covers the world literature in statistical methods, opera- 
tions research and reliability. 


. provides, in a single source, digests of all significant 
papers of interest to applied statisticians appearing in 
over 400 American and foreign periodicals. 


. includes the important new contributions appearing in 
the frequently scarce transactions of scientifiic meetings 
in the U. S. and abroad. 


QCAS furnishes a much-needed reference source for a field 
in which the volume of publication is growing rapidly. 


The abstracts are written by competent reviewers who in 
most cases are themselves contributors to the field. The ab- 
stracts are lengthy and contain original data, graphs, etc., to 
a sufficient extent to satisfy most needs. References to re- 
lated work are frequently appended. Each article is coded for 
convenience in filing. An annual index is provided. 











QUALITY CONTROL 

AND APPLIED STATISTICS 
including Operations Research 
and Reliability 


12 issues ° 1400 loose-leaf pages 
600 digests ° $90 per calendar year 


For a free sample issue of QCAS, write to: 


QCAS Abstracts— 


INTERSCIENCE PUBLISHERS, INC. 


250 Fifth Avenue, New York 1, N. Y. 
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IMPORTANT MACMILLAN TEXTBOOKS ON STATISTICS . . . 





INTRODUCTION TO MATHEMATICAL STATISTICS 


ROBERT V. HOGG and ALLEN T. CRAIG, both of the University of Iowa 
The Allendoerfer Mathematics Series 


Concentrates on basic concepts, particularly on the concept of a sufficient statistic. 
Answer book available gratis 1960, 245 pp., $6.75 


INTRODUCTION TO PROBABILITY AND STATISTICS 

B. W. LINDGREN and G. W. McELRATH, both of the Institute of Technology, 
University of Minnesota 

Presents classical and modern statistical methods, with a thorough preliminary treatment of 
probability. Answer book available gratis 1959, 277 pp., Ill., $6.25 


AN INTRODUCTION TO EXPERIMENTAL DESIGN 


WILLIAM S. RAY, Woman’s College, University of North Carolina 
Central issues involved in the design and evaluation of experiments examined according to 
the different analyses of variability. 1960, 254 pp., Ill., $6.50 


INTRODUCTION TO MODERN STATISTICS 


With Applications to Business and Economics 
WERNER Z. HIRSCH, Washington University 


For students with only elementary college algebra—designed to develop proficiency in the 
use of modern techniques of statistical inference. 1957, 429 pp., Ill., $6.50 


The Macmillan Company 0 ritth avenue, New York 11, N.Y, 











BUSINESS STATISTICS 


By Dr. John R. Stockton 
The University of Texas 


Here is a skillfully written book that stresses the value of statistical 
analysis in the solution of business problems, The mathematics needed 
by the student is explained as the principles of analysis and the formulas 
are presented. Many charts are used to illustrate the various uses of the 
gtaphic method. 


The problem material in BUSINESS STATISTICS is outstanding, both 
in quality and quantity. Twenty comprehensive problems are provided in 
the Laboratory Problems for BUSINESS STATISTICS. 


SOUTH-WESTERN PUBLISHING CO. 


(Specialists in Business and Economic Education) 
Cincinnati 27 New Rochelle, N.Y. Chicago 5 San Francisco 3 Dallas 2 
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NEW! 


M. H. Quenouille: RAPID STATISTICAL CALCULATIONS 


Prepared for quick reference, this handy compendium offers a collection of dis- 
tribution free and easy methods of estimation and testing. Included is a “per- 
centage efficiency” figure for each system. Methods and comments, and a worked 
example are shown on opposite pages. Contains 9 valuable tables and a bibliog- 
raphy. Size 6 x 4 inches. $1.65 


E. Lukacs: CHARACTERTISTIC FUNCTIONS 


A work of pure mathematics for use in advanced statistical courses. It explains 
characteristic functions for their intrinsic interest. 212 pages (Double Volume). 


$6.45 
E. A. Robinson: INFINITELY MANY VARIATES 


This monograph gives a precise presentation of probability theory, limit theorems, 
and stationary stochastic processes. Many proofs and applications are outlined or 
shown as exercises. 132 pages, bibliography. $4.75 


STECHERT-HAENER, INC. 


FOUNDED IN NEW YORK 1872 


ie Whrld % oLeading Sriternattenal Booksellers 


31 EAST 10th STREET, NEW YORK 3, N. Y. 











JOURNAL OF FARM ECONOMICS 
Published by 
THE AMERICAN FARM ECONOMIC ASSOCIATION 


Editor: Herman M. SoutnwortH 
The Pennsylvania State University, University Park, Pennsylvania 


Volume XLII August, 1960 Number 3 





The Analysis of Changes in Agricultural Supply: Problems and Approaches .. 
Mare Nerlove and Kenneth L. Bachman 


Coordination and Vertical Expansion in Marketing Cooperatives 
John A. Jamison 


A Framework for Rural Development ........-s.cecccccscees J. KE. McDermott 
Grain Storage Rules Conrad Gislason 


Computation of Variance Estimates for Marginal Physical Products and Mar- 
ginal Rates of Substitution John P. Doll, Emil H. Jebe, and Robert D. Munson 
A “Unit Survey” Census in Southern Peru John C. Hobbes 
Land Accumulation in the Turkish Cukurova John Hiltner 
Reclamation Under the Desert Land Act, 1954-59 Karl S. Landstrom 
Additional articles, notes, book reviews, and announcement of new bulletins and 
other publications in agricultural economics. 


Published in February, May, August, November, and December. $9 yearly. 


Secretary-Treasurer: C. Del Mar Kearl 


Department of Agricultural Economics 
Cornell University, Ithaca, New York 
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PUBLICATIONS IN OPERATIONS RESEARCH 
SPONSORED BY 
THE OPERATIONS RESEARCH SOCIETY OF AMERICA 


Monographs are requested for consideration for pub- 
lication in the Operations Research series, sponsored by 


The Operations Research Society of America. 


We are particularly interested in explorations and 
analytical work of a nature which might ordinarily not 
be published. 


Details will be furnished on request to: 
Dr. David B. Hertz, 
Associate Editor for Publications, 


The Operations Research Society 


of America, 
c/o Arthur Andersen & Co., 
80 Pine Street, 
New York 5, New York. 


Published by 


John Wiley & Sons, Inc. 
440 Park Avenue South, NYC 16 
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BIOMETRICS 


Journal of the Biometric Society 
Vol. 16, No. 3 CONTENTS September 1960 


Maximum Likelihood Estimation of Genetic Components of Variation B. I. Hayman 
An Expository Note on Estimation of Simultaneous Structural Equations R. L. Basmann 


On Preliminary Sampling with Special Reference to the Microbiological Control of 
Tomato Concentrates O. Elteto and K. Sarkadi 


The Treatment of Reciprocal Interaction, with or without Lag, in Path Analysis 
Sewall Wright 


Estimation of Regression Slope from Tail Regions with Special Reference to the 
Volume Line M. L. Dudzinski 


Polychotomous Quantal Response in Biological Assay 
John Guzland, 'lbok Lee, and Paul A. Dahm 


Optimum Allocation in Regression Experiments with Two Components of Error 
Robert M. DeBaun and Victor Chew 


An Extension of a Truncated Poisson Distribution A. Clifford Cohen, Jr. 
The Analysis of a Catch Curve D. G. Chapman and D. S. Robson 


A Note on the Additive Partitioning of Chi-Square in Contingency Tables 
Marvin A. Kastenbaum 


Experimental Design in the Estimate of Heritability by Regression Methods 
B. D. H. Latter and Alan Robertson 


Efficiency of Animal Testing Schemes Charles Smith 

Queries and Notes 

The Question of Stability with Positive Feedback Everett R. Dempster 

Straight Line Regression Through the Origin Malcolm E. Turner 
Biometrics is published quarterly. Its objects are to describe and exemplify the use of 

mathematical and statistical methods in biological and related sciences, in a form assimil- 


able by experimenters. The annual non-member subscription rate is $7. Inquiries, orders 
for back issues and non-member subscriptions should be addressed to: 


BIOMETRICS 
Department of Statistics 
The Florida State University 
Tallahassee, Florida 
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The Annals of Mathematical Statistics 


THE OFFICIAL JOURNAL OF THE INSTITUTE OF 
MATHEMATICAL STATISTICS 


—_s 
— 


Vol. 31, No. 3—September, 1960 
Contents 
ee I, SI ink nk cas dane coussade chesncewaasieed ..J. R. Blum 


The Capacities of Certain Channel Classes Under Random Coding .....--. 
David Blackwell, Leo Breiman and A. A Thomasian 


On the Estimation of the Spectrum of a Stationary Stochastic Process K. R. Parthasarathy 
Expectation of Functionals on a Stochastic Process ...Edward Nelson and Dale Varberg 
Ergodicity of Queues in Series ....... 6.064 6000.06.0600-060006000004neseel 

Gennes Sow o Rie Cocie Teale BAG oo. oe.ssccccrccaresssicsccegsssaans C. F. Newell 
Probability Content of Regions Under Spherical Normal Distribution, I ..Harold Ruben 
Graphs for Bivariate Normal Probabilities Marvin Zelen and Norman C. Severo 


Charts of Some Upper Percentage Points of the Distribution of the Largest Character- 
CE SD 06d.940.06.0-000500006Kineparecensbabenaeenbieenansees Pry 2 ae 


Correction for Bias Introduced by a Transformation of Variables 
Jerzy Neyman and Elizabeth L. Scott 


Ga o Beale of F. Tein GHEE, GEE oooccccsncrcccsekancdavceesss L. Schmetterer 


Estimates With Prescribed Variance Based on Two-Stage Samplin 
Allan Birnbaum an 


A Monotonicity Property of the Sequential Probability Ration Test ...Robert A. Wiisman 
Locally Most Powerful Rank Tests for Two-Sample Problems Hirofumi Uzawa 
On the Structure of Distribution-Free Statistics ..........++ee0. errr rryy C. B. Bell 


Small Sample Distributions for Multi-Sample Statistics of the Smirnov Type 
. W. Birnbaum aad R. A. Hall 


Polya Type Distribution of Convolutions Samuel Karlin and Frank Proschan 


The Poisson Approximation to the Poisson Binomial Distribution 
J Hodges, Jr. and Lucien LeCam 


Confidence Bounds Connected with ANOVA and MANOVA for Balanced and Par- 
tially Balanced Incomplete Block Designs V. P. Bhapkar 


A Class of Factorial Designs with Unequal Cell-Frequencies Giedon Schwarz 
A Generalization of Group Divisible Designs ..... jtbeedvbdoeda Damaraju Raghavarao 
Relations Among the Blocks of the Kronecker Product of Designs Manohar Narbar Vartak 
The Dual of a Balanced Incomplete Block Design 


NOTES: 
On the Unbiasedness of Yates’ Method of Estimation Using Interblock Information 
Franklin Graybill and V. Seshadri 


On the Block Structure of Certain PBIB Designs with Two Associate Classes Having 
Triangular and L Association Schemes Damaraju Raghavarao 


Optimality Criteria for Incomplete Block Designs ............++++ oveeesene 


On the Completeness of Order Statistics 
pan.sos nee badenébnkpeeaamnaiaae C. B. Bell, David Blackwell and Leo Breiman 


On Centering Infinitely Divisible Processes Ronald Pyke 
The Strong Law of Large Numbers for a Class of Markov Chains Leo Breiman 
Emptiness in the Finite Dam .......-+ceseesees sévebseecenad Hbeceeseeseed A, Ghosal 
Correction to ‘‘The Individual Ergodic Theorem of Information Theory ....Leo Breiman 
Correction Notes 

Abstracts of Papers 

News and Notices 

Publications Received 





— 
—_ 





Address orders for subscriptions and back numbers to Professor George E. Nichol- 
son, Jr., Secretary, Institute of Mathematical Statistics, Department of Statistics, 
University of North Carolina, Chapel Hill, North Carolina. 
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JOURNAL OF THE ROYAL STATISTICAL SOCIETY 


THE JOURNAL OF THE ROYAL STATISTICAL SOCIETY is published in 
two series: SERIES A (GENERAL), four issues a year, 20s each part, annual 
subscription. £4 4s (U.S. $12) post free; SERIES B (METHODOLOGICAL), 
two issues a year, 30s each part, annual subscription £3 2s (U.S. $9) post free. 


Contents of Series A, Vol. 123, Part 3, 1960 


The Economics of Building Designs. By P. A. Stone. (With Discussion). 


The Present Position of Econometrics. A Discussion opened by Joan Robinson, J. Downie and 
C. B. Winsten. 


Some Experiments in Methods of Conducting Family Expenditure Surveys. By W. F. F. Kemsley 
and J. L. Nicholson. 


Notes on Sequential Sampling Plans. By F. J. Anscombe. 
Wholesale Prices in 1959. By The Editor of The Statist. 


Reviews, Statistical Notes, Additions to Library. 


Contents of Series B, Vol. 22, No. 2, 1960 


Models in the Analysis of Variance. By R. L. Plackett. (With Discussion). 


Discrete Stochastic Processes in Population Genetics. By W. F. Bodmer. (With Discussion). 
A Problem of Delayed Service—I. By A. Ben-Israel and P. Naor. 

A Problem of Delayed Service-——II. By A. Ben-Israel and P. Naor. 

On the Transient Behavior of a Simple Queue, By P. D. Finch. 

Queueing at a Single Serving Point with.Group Arrival. By B. W. Conolly. 

On Some Extensions of Bayesian Inference Proposed by Mr. Lindley. By R. A. Fisher. 


A Method of Estimation of Missing Values in Multivariate Date Suitable for Use with an 
Electronic Computer, By S. F. Buck. 


Estimation of Parameters of a Multivariate Normal Population from Truncated and Censored 
Populations. By Naunihal Singh. 


Necessary Restrictions for Distributions a posteriori. By D. A. Sprott. 


Weight of Evidence, Corroboration, Explanatory Power, Information and the Utility of Ex- 
periments. By I. J. Good. 


The Logistic Process: Tables of the Stochastic Epidemic Curve and Applications. By Edwin 
Mansfield and Carlton Hensley. 


Some Notes of Pistimetric Inference. By A. D. Roy. 


Evaluation of Determinants, Characteristic Equations and their Roots for a Class of Patterned 
Matrices. By S. N. Roy, B. G. Greenberg and A. E. Sarhan. 


Bounds for the Expected Sample Size in a Sequential Probability Ratio Test. By M. N. Ghosh. 
A Note on Tests of Homogeneity Applied after Sequential Sampling. By D. R. Cox. 

The Interaction Algorithm and Practical Fourier Analysis: an Addendum, By I. J. Good. 
Approximate Solutions of Green’s Type for Univariate Stochastic Processes. By H. E. Daniels. 
The Wilcoxon Test and Non-null Hypotheses. By G. B. Wetherill. 


Obtainable through any bookseller or direct from the Royal Statistical Society, 21 Benlinck 
Street, London W. 1. 
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POPULATION STUDIES 


A Journal of Demography 
Edited by D. V. Grass and E. GreBeNniIk 
Vol. XIV, No. 1 CONTENTS July 1960 


MICHAEL K. ROOF. The Russian Population Enigma Reconsidered. 


GALINA V. SELEGEN. The First Report on the Recent Population Census in 
the Soviet Union. 


IRENE B. TAEUBER. Japan’s Demographic Transition Re-Examined. 


R. G. POTTER, JR. Some Comments on the Evidence Pertaining to Family Limi- 
tation in the United States. 


S. H. COUSENS. Regional Death Rates in Ireland During the Great Famine from 
1846 to 1851. 


Book Reviews. 


Subscription price per volume of 3 parts 42s. net, post free 
(or American currency $6.75). 
Single parts £1 each plus postage (American $3.25, post free). 
Published by the POPULATION INVESTIGATION COMMITTEE, at the LONDON SCHOOL 
OF ECONOMICS and POLITICAL SCIENCE, 15 HOUGHTON STREET, LONDON, W.C.2. 











ESTADISTICA 


Journal of the 
Inter American Statistical Institute 


Vot. XVIII, No. 66 Marcu 1960 
CONTENTS 


Ndmeros Indices con Ponderaciones Evolutivas Fausto I. Toranzos 


Algunas Observaciones Sobre el Censo Agropecuario Mundial de 1960 (traduccié6n) 
P. C. Mahalanobis 


El Censo Agropecuario Mundial de 1960 (traduccién) P. V. Sukhatme 
Algunos Métodos Gr&ficos en la Estadistica Jorge Arias B. 


Programa de Evyaluacién e Investigacién de los Resultados de los Censos de 1960 
(traduccién) Morris H. Hansen, Leon Pritzker, Joseph Steinberg 


Cancer Mortality and Prevalence in Chile Juan Moréder and Jerjes Vildésola 
Caractéristiques de la Population Haitienne Jacques Vilgrain 


Legal Provisions. International Resolutions Relating to Statistics. Institute Affairs. 
Statistical News. Publications. 


Published quarterly Annual subscription price $3.00 (U.S.) 


Inter American Statistical Institute 
Pan American Union, Washington 6, D.C. 
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